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Polymorphisms and New Genes in the Region of the Human Hemochromatosis 

Gene 

5 BACKGROUND OF THE INVENTION 

Hereditary hemochromatosis (HH) is an inherited disorder of iron metabolism wherein 
the body accumulates excess iron. In symptomatic individuals, this excess iron leads to deleterious 
effects by being deposited in a variety of organs leading to their failure, and resulting in cirrhosis, 
diabetes, sterility, and other serious illnesses. The gene which is defective in this disease was 
10 disclosed in copending U.S. S.N. 08/652,265. 

Fine structure mapping of the region to which the gene responsible for HH, HFE 
(denoted HH or HFE in some publications), was mapped makes possible the identification of candidate 
sequences comprising the HFE gene, along with structural elements for regulation and expression and 
neighboring genes. 

1 5 A variety of techniques is available for fine structure mapping, including direct cDNA 

selection, exon-trapping, and genomic sample sequencing. The direct selection approach (Lovett et a/. 
Proc. Natl. Acad. Set. U.S.A. 88:9628-9623 (1991)) involves the hybridization of cDNA fragments to 
genomic DNA. This technique is extremely sensitive and capable of isolating portions of rare 
transcripts. Exon-trapping (Church et al. Nature Genetics 6:98-105 (1994)) recovers spliced introns 

20 from in vivo expressed genomic DNA clones and produces candidate exons without requiring any prior 
knowledge of the target's gene expression. High-throughput genomic DNA sequencing with 
comparison of the sequence data to databases of expressed sequences has also been used, such as 
in the positional cloning of the Werner syndrome gene (Yu et al. Science 277:258-262 (1 996)) and in 
cloning by homology of the second Alzheimer's disease gene on chromosome 1 (Levy-Lahad et aL 

25 Science 269:973-977 (1 995)). 

HH is typically inherited as a recessive trait: in the current state of knowledge, 
homozygotes carrying two defective copies of the gene are most frequently affected by the disease. In 
addition, heterozygotes for the HFE gene are more susceptible to sporadic porphyria cutanea tarda 
and potentially other disorders (Roberts et al., Lancet 349:321-323 (1997). It is estimated that 

30 approximately 1 0-1 5% of Caucasians carry one copy of the HFE gene mutation and that there are 
about one million homozygotes in the United States. HH, thus, represents one of the most common 
genetic disease mutations in Caucasian individuals. Although ultimately HH produces debilitating 
symptoms, the majority of homozygotes and heterozygotes have not been diagnosed. 

The need for such diagnostics is documented, for example, in Barton, J.C. et al. 

35 Nature Medicine 2:394-395 (1996); Finch, C.A. West J Med 153:323-325 (1990); McCusick, V 

Mendelian Inheritance in Man pp. 1882-1887, 11th ed., (Johns Hopkins University Press, Baltimore 
(1994)); Report of a Joint World Health Organization/Hemochromatosis Foundation/French 
Hemochromatosis As sociation Meeting on the Prevention and Control of Hemochromatosis (1993) 
Edwards. C.Q. et al. New Enal J Med 328:1616-1620 (1993); Bacon, B.R. New Enol J MpH 326:126- 
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127 (1992); Balan, V. et al. Gastroenterology 107:453-459 (1994); Phatak. P.D. et al. Arch Int Med 
154:769-776(1994). 

A single mutation in the HFE gene, designated 24d1 in copending U.S.S.N. 
08/630,912, gave rise to the majority of disease-causing chromosomes present in the population today. 
5 This is referred to herein as the "common" or "ancestral" or "common ancestral" mutation. These 

terms are used interchangeably. It appears that about 80% to 90% of all HH patients carry at least one 
copy of the common ancestral mutation which is closely linked to specific alleles of certain genetic 
markers close to this ancestral HFE gene defect. These markers are, as a first approximation, in the 
allelic form in which they were present at the time the ancestral HFE mutation occurred. See, for 

10 example, Simon, M. et al. Am J Hum Genet 41 :89-105 (1987); Jazwinska, E.C. et al. Am J Hum Genet 
53:242-257 (1993); Jazwinska, E.C. et al. Am J Hum Genet 56:428-433 (1995); Worwood, M. et al. Brit 
J Hematol 86:863-866 (1994); Summers, K.M. et al. Am J Hum Gen*t asai-ar (1989). 

Several polymorphic markers in the HFE region have been described and shown to 
have alleles that are associated with HH disease. These markers include the published microsatellite 

15 markers D6S258, D6S306 (Gyapay, G. et al. Nature Genetics 7:246-339 (1994)), D6S265 (Worwood. 
M. et al. Brit J Hematol 86:833-846 (1994)), D6S105 (Jazwinska. E.C. et al. Am J Hum Genet 
53:242-257 (1993); Jazwinska. E.C. et al. Am J Hum Genet 56:428-433 (1995)), D6S1001 (Stone, C. 
et al. Hum Mplec Genet 3:2043-2046 (1994)), D6S1260 (Raha-Chowdhury et al. Hum Molec Genet 
4:1869-1874 (1995)) as well as additional microsatellite and single-nucleotide-polymorphism markers 

20 disclosed in co-pending PCT application WO 96/06583, the disclosure of which is hereby incorporated 
by reference in its entirety. Additionally, copending U.S.S.N. 08/630.912 disclosed additional markers 
24d2 and 24d7. 

The symptoms of HH are often similar to those of other conditions, and the severe 
effects of the disease often do not appear immediately. Accordingly, it would be desirable to provide a 

25 method to identify persons who may be destined to become symptomatic in order to intervene in time 
to prevent excessive tissue damage associated with iron overload. One reason for the lack of early 
diagnosis is the inadequacy of presently available diagnostic methods to ascertain which individuals are 
at risk, especially while such individuals are presymptomatic. 

Although blood iron parameters can be used as a screening tool, a confirmed 

30 diagnosis often employs liver biopsy which is undesirably invasive, costly, and carries a risk of mortality. 
Thus, there is a clear need for the development of an inexpensive and noninvasive diagnostic test for 
detection of homozygotes and heterozygotes in order to facilitate diagnosis in symptomatic individuals, 
provide presymptomatic detection to guide intervention in order to prevent organ damage, and for 
identification of heterozygote carriers. 

35 Furthermore, a need exists for both methods for fine structure mapping and a fine 

structure map of the region of the chromosome to which the HH locus maps. This and other needs 
are addressed by the present invention. 
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SUMMARY OF THE INVENTION 

One aspect of the invention is an oligonucleotide comprising at least 8 to about 100 
consecutive bases from the sequence of Figure 9, or the complement of the sequence, wherein the at 
least 8 to about 100 consecutive bases includes at least one polymorphic site of Table 1 

Another aspect of the invention is an oligonucleotide pair selected from the sequence 
of Figure 9 or its complement for amplification of a polymorphic site of Table 1 

Another aspect of the invention is an isolated nucleic acid molecule comprising about 
100 consecutive bases to about 235 kb substantially identical to the sequence of Figure 9, wherein the 
DNA molecule comprises at least one polymorphic site of Table 1 . 

Another aspect of the invention is a method to determine the presence or absence of 
the common hereditary hemochromatosis (HFE) gene mutation in an individual comprising: 

providing DNA or RNA from the individual; and 

assessing the DNA or RNA for the presence or absence of a haplotype of 

Table 1, 

wherein, as a result, the absence of a haplotype of Table 1 indicates the likely 
absence of the HFE gene mutation in the genome of the individual and the presence of the haplotype 
indicates the likely presence of the HFE gene mutation in the genome of the individual. 

Another aspect of the invention is a method to determine the presence or absence of 
the common hereditary hemochromatosis (HFE) gene mutation in an individual comprising: 

providing DNA or RNA from the individual; and 

assessing the DNA or RNA for the presence or absence of a genotype 
defined by a polymorphic allele of Table 1, 

wherein, as a result, the absence of a genotype defined by a polymorphic 
allele of Table 1 indicates the likely absence of the HFE gene mutation in the genome of the individual 
and the presence of the genotype indicates the likely presence of the HFE gene mutation in the 
genome of the individual. 

Another aspect of the invention is a culture of iymphoblastoid cells having the 
designation ATCC CRL-1 2371 . 

One aspect of the invention is an isolated nucleic acid sequence comprising a nucleic 
acid sequence substantially identical to BTF1 . 

A further aspect of the invention is an isolated nucleic acid sequence comprising a 
nucleic acid sequence substantially identical to BTF2. 

A further aspect of the invention is an isolated nucleic acid sequence comprising a 
nucleic acid sequence substantially identical to BTF3. 

A further aspect of the invention is an isolated nucleic acid sequence comprising a 
nucleic acid sequence substantially identical to BTF4. 

A further aspect of the invention is an isolated nucleic acid sequence comprising a 
nucleic acid sequence substantially identical to BTF5. 

A further aspect of the invention is an isolated nucleic acid sequence comprising a 
nucleic acid sequence substantially identical to NPT3. 
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A further aspect of the invention is an isolated nucleic acid sequence comprising a 
nucleic acid sequence substantially identical to NPT4. 

A further aspect of the invention is an isolated nucleic acid sequence comprising a 
nucleic acid sequence substantially identical to RoRet. 

Additional aspects of the invention include nucleic acid sequences that are cDNAs, 
polypeptides encoded by the nucleic acids of the invention and antibodies specially immunoreactive 
thereto, vectors comprising the nucleic acid sequences of the invention, and hor.t cells stably 
transfected with the nucleic acids of the invention. 

A further aspect of the invention is an isolated nucleic acid sequence comprising at 
least 18 contiguous nucleotides substantially identical to at least 18 contiguous nucleotides of BTF1 
A further aspect of the invention is an isolated nucleic acid sequence comprising at 
least 18 contiguous nucleotides substantially identical to at least 18 contiguous nucleotides of BTF2. 

A further aspect of the invention is an isolated nucleic acid sequence comprising at 
least 18 contiguous nucleotides substantially identical to at least 18 contiguous nucleotides of BTF3. 

A further aspect of the invention is an isolated nucleic acid sequence comprising at 
least 18 contiguous nucleotides substantially identical to at least 18 contiguous nucleotides of BTF4. 

A further aspect of the invention is an isolated nucleic acid sequence comprising at 
least 18 contiguous nucleotides substantially identical to at least 18 contiguous nucleotides of BTF5. 

A further aspect of the invention is an isolated nucleic acid sequence comprising at 
least 18 contiguous nucleotides substantially identical to at least 18 contiguous nucleotides of NPT3. 

A further aspect of the invention is an isolated nucleic acid sequence comprising at 
least 18 contiguous nucleotides substantially identical to at least 18 contiguous nucleotides of NPT4. 

A further aspect of the invention is an isolated nucleic acid sequence comprising at 
least 18 contiguous nucleotides substantially identical to at least 18 contiguous nucleotides of RoRet. 

BRIEF DESCRIPTION OF TH F DRAWINGS 
Figure 1 depicts a combination genetic, physical and transcription map of the HFE 
gene region. The first line shows the relative positions of selected genetic markers that define the HFE 
region. The heavy bar below represents the YAC clone used in the direct selection experiment. The 
order and positions of the bacterial clones employed in the exon-trapping and sample sequencing is 
indicated under the YAC. The thin bar under the bacterial clones represents the approximate locations 
of a subset of the expressed sequence fragments mapped to the contig. The thicker bars show the 
location of the cDNAs cloned. Two regions are bracketed; the butyrophilin family of genes (BTF), and 
the region where complete genomic sequencing was carried out. 

Figure 2 is a schematic of the 250 kb of genomic sequence including the HFE gene. 
Both the structure of the overall cDNA (top) and that corresponding to the coding regions (bottom), as 
well as the direction of transcription are shown. The positions of the histone genes, the zinc a-2 
glycoprotein pseudogene, and the ESTs are also shown. 

Figure 3 depicts an alignment of the predicted amino acid sequence of the BTF 
proteins. Sequences were aligned in a pair-wise fashion using CLUSTAL W (Thompson et at. Nucl. 
Acids Res - 22:4673-4680) to deduce the most parsimonious arrangement. The asterisks under the 
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alignment represent amino acids conserved in all 6 proteins; the "dots" represent conserved amino 
acids substitutions. Boxed are the regions within the proteins which correspond to three conserved 
motifs: 1) the B-G domain, 2) the transmembrane domain (TM), and 3) the B30-2 exon domain. 

Figure 4, panel (A) depicts a Northern blot analysis of representative members of the 
two groups of BTF proteins, BTF1 and BTF5. BTF1 hybridized to all tissues on the blot as a major 
transcript at 2.9 kb and a minor one at 5.0 kb. BTF5 hybridized to several transcripts ranging between 
4.0 and 3.1 kb and as a similar expression profile to BTF1 . Autoradiography was for 24 hours. The 
p-actin hybridization demonstrated the variation in ploy (A)+ RNA between the lanes. Autoradiography 
was for 1 hour. In panel (B), RT-PCR analysis demonstrated that the expression of both genes was 
widespread. Included in the (+) lane are cDNA 21 and 44 as positive controls; the (-) lane represents 
the no-DNA control. Amplification using primers for the RFP gene (Isomura etal. Nucleic Add R P * 
20:5305-5310 (1992)) controlled for the integrity of the cDNA. All first strand cDNAs were checked for 
contaminating genomic DNA amplification by carrying out an identical experiment excluding the 
reverse transcriptase. In all cases, no amplification was obtained (data not shown). 

Figure 5(A) depicts an alignment of the predicted amino acid sequence of the RoRet 
gene to the 52 kD Ro/SSA auto-antigen protein. The asterisks under the alignment represent 
conserved amino acids; the "dots" represent conserved amino acids substitutions. The putative DNA 
binding cysteine-rich domain and the B30-2 exon domain are boxed. Figure 5(B) depicts an alignment 
of the predicted amino acid sequence of the two novel putative sodium phosphate transport proteins to 
that of the NPT1 . 

Figure 6, panel (A) depicts a Northern blot analysis of the RoRet gene. The RoRet 
cDNA hybridized to 4 different transcripts, ranging from 7.1 kb to 2.2 kb. Autoradiography was 
performed for 4 days. The re-hybridization of the blot with a P-actin probe showed the variation in poly 
(A)+ RNA between the lanes. Autoradiography was for 1 hour. Panel (B) depicts RT-PCR analysis of 
the RoRet gene. Included in the (+) lane was a cDNA 27 positive control. Weak amplification of the 
correct size was observed in the small intestine, kidney and liver. The other tissues were negative as 
was the no DNA control lane (-). The RFP primers demonstrated the integrity of the cDNA. Panel (C) 
depicts Northern blot analysis of NPT3 and NPT4. NPT3 was expressed at high abundance in the 
heart and muscle as a single 7.2 kb transcript. Lesser amounts were found in the other tissues. The 
expression pattern of NPT4 was more restricted, being found only in the liver and kidney as a smear of 
transcripts ranging from 2.6 to 1 .7 kb. Panel (D) depicts RT-PCR analysis of the NPT3 and NPT4 
genes. Included in the (+) lane were the respective CDNA22E and 22B positive controls. The NPT3 
gene was expressed as the proper size PCR fragment in kidney, liver, spleen and testis. A smaller 
fragment was detected in all tissues with the exception of the liver. The no DNA control lane (-) was 
negative. NPT4 was expressed as the proper size fragment in the small intestine, kidney, liver and 
testis. Larger and smaller size fragments were found in all other tissues with the exception of the brain. 
For both genes these different size fragments may indicate alternative splice events. The no DNA 
control lane (-) was negative. The RFP primers demonstrated the integrity of the cDNA. 

Figure 7 depicts the sequences of cDNA 21 (BTF1), cDNA 29 (8TF3), cDNA 23 
(BTF4), cDNA 44 (BTF5). cDNA 32 (BTF2), cDNA 27 (RoRet). cDNA 22B (NPT3), CDNA22E (NPT4). 
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Figure 8 depicts the nucleotide sequence of approximately 235 kb in the HFE 
subregion from an unaffected individual. 

Figure 9 depicts the nucleotide sequence of approximately 235 kb in the HFE 
subregion from an HH affected individual. Polymorphic sites in the HH affected individual determined 
by comparing a sequence of the corresponding region from an HH unaffected individual are listed and 
described in Table I. 

DETAILED D ESCRIPTION 

A. Definitions 

Abbreviations for the twenty naturally occurring amino acids follow conventional 
usage. In the polypeptide notation used herein, the left-hand direction is the amino terminal direction 
and the right-hand direction is the carboxyl-terminal direction, in accordance with standard usage and 
convention. Similarly, unless specified otherwise, the left hand end of single-stranded polynucleotide 
sequences is the 5' end; the left hand direction of double-stranded polynucleotide sequences is 
referred to as the 5' direction. The direction of 5 1 to 3 1 addition of nascent RNA transcripts is referred to 
as the transcription direction; sequence regions on the DNA strand having the same sequence as the 
RNA and which are 5' to the 5' end of the RNA transcript are referred to as "upstream sequences"; 
sequence regions on the DNA strand having the same sequence as the RNA and which are 3' to the 3' 
end of the RNA transcript are referred to as "downstream sequences". 

The term "nucleic acids", as used herein, refers to either DNA or RNA. "Nucleic acid 
sequence" or "polynucleotide sequence" refers to a single- or double-stranded polymer of 
deoxyribonucleotide or ribonucleotide bases read from the 5" to the 3' end. It includes both self- 
replicating plasmids, infectious polymers of DNA or RNA and nonfunctional DNA or RNA. The 
complement of any nucleic acid sequence of the invention is understood to be included in the definition 
of that sequence. 

"Nucleic acid probes" may be DNA or RNA fragments. DNA fragments can be 
prepared, for example, by digesting plasmid DNA, or by use of PCR, or synthesized by either the 
phosphoramidite method described by Beaucage and Carruthers, Tetrahedron l gg 22:1859-1862 
(1981), or by the triester method according to Matteucci, etal., J. Am. Cham s r < 103:3185 (1981), 
both incorporated herein by reference. A double stranded fragment may then be obtained, if desired, 
by annealing the chemically synthesized single strands together under appropriate conditions or by 
synthesizing the complementary strand using DNA polymerase with an appropriate primer sequence. 
Where a specific sequence for a nucleic acid probe is given, it is understood that the complementary 
strand is also identified and included. The complementary strand will work equally well in situations 
where the target is a double-stranded nucleic acid. 

The phrase "selectively hybridizing to" refers to a nucleic acid probe that hybridizes, 
duplexes or binds only to a particular target DNA or RNA sequence when the target sequences are 
present in a preparation of total cellular DNA or RNA. "Complementary" or "target" nucleic acid 
sequences refer to those nucleic acid sequences which selectively hybridize to a nucleic acid probe. 
Proper annealing conditions depend, for example, upon a probe's length, base composition, and the 
number of mismatches and their position on the probe, and must often be determined empirically. For 
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discussions of nucleic acid probe design and annealing conditions, see. for example, Sambrook et at., 
Molecular Clpnjng; a Laboratory Manual (2nd ed.). Vols. 1-3, Cold Spring Harbor Laboratory. (1989) or 
Current Protocols in Molecular Biology , F. Ausubel era/., ed. Greene Publishing and Wiley- 
Interscience. New York (1987). 

The phrase "nucleic acid sequence encoding" refers to a nucleic acid which directs 
the expression of a specific protein or peptide. The nucleic acid sequences include both the DNA 
strand sequence that is transcribed into RNA and the RNA sequence that is translated into protein. 
The nucleic acid sequences include both the full length nucleic acid sequences as well as non-full 
length sequences derived from the full length protein. It being further understood that the sequence 
includes the degenerate codons of the native sequence or sequences which may be introduced to 
provide codon preference in a specific host cell. 

The phrase "isolated" or "substantially pure" refers to nucleic acid preparations that 
lack at least one protein or nucleic acid normally associated with the nucleic acid in a host cell. 

The phrase "expression cassette", refers to nucleotide sequences which are capable 
of affecting expression of a structural gene in hosts compatible with such sequences. Such cassettes 
include at least promoters and optionally, transcription termination signals. Additional factors 
necessary or helpful in effecting expression may also be used as described herein. 

The term "operably linked" as used herein refers to linkage of a promoter upstream 
from a DNA sequence such that the promoter mediates transcription of the DNA sequence. 

The term "vector", refers to viral expression systems, autonomous self-replicating 
circular DNA (plasmids), and includes both expression and nonexpression plasmids. Where a 
recombinant microorganism or cell culture is described as hosting an "expression vector," this includes 
both extrachromosomal circular DNA and DNA that has been incorporated into the host 
chromosome(s). Where a vector is being maintained by a host cell, the vector may either be stably 
replicated by the cells during mitosis as an autonomous structure, or is incorporated within the host s 
genome. 

The term "gene" as used herein is intended to refer to a nucleic acid sequence which 
encodes a polypeptide. This definition includes various sequence polymorphisms, mutations, and/or 
sequence variants wherein such alterations do not affect the function of the gene product. The term 
"gene" is intended to include not only coding sequences but also regulatory regions such as promoters, 
enhancers, and termination regions. The term further includes all introns and other DNA sequences 
spliced from the mRNA transcript, along with variants resulting from alternative splice sites. 

The term "plasmid" refers to an autonomous circular DNA molecule capable of 
replication in a cell, and includes both the expression and nonexpression types. Where a recombinant 
microorganism or cell culture is described as hosting an "expression plasmid", this includes both 
extrachromosomal circular DNA molecules and DNA that has been incorporated into the host 
chromosome(s). Where a plasmid is being maintained by a host cell, the plasmid is either being stably 
replicated by the cells during mitosis as an autonomous structure or is incorporated within the host s 
genome. 
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The phrase "recombinant protein" or "recombinantly produced protein" refers to a 
peptide or protein produced using non-native cells that do not have an endogenous copy of DNA able 
to express the protein. The cells produce the protein because they have been genetically altered by 
the introduction of the appropriate nucleic acid sequence. The recombinant pro .ein will not be found in 
association with proteins and other subcellular components normally associated with the cells 
producing the protein. The terms "protein" and "polypeptide" are used interchangeably herein. 

The following terms are used to describe the sequence relationships between two or 
more nucleic acids or polynucleotides: "reference sequence", "comparison window", "sequence 
identity", "percentage of sequence identity", and "substantial identity". A "reference sequence" is a 
defined sequence used as a basis for a sequence comparison; a reference sequence may be a subset 
of a larger sequence, for example, as* segment of a full-length cDNA or gene sequence given in a 
sequence listing, or may comprise a complete cDNA or gene sequence. 

Optimal alignment of sequences for aligning a comparison window may, for example, 
be conducted by the local homology algorithm of Smith and Waterman Adv. Annl Math , 2:482 (1981), 
by the homology alignment algorithm of Needleman and Wunsch J. Mol. Biol 48:443 (1 970), by the 
search for similarity method of Pearson and Lipman Proc. Natl. Acad. Sd.llSA 85:2444 (1988), or by 
computerized implementations of these algorithms (for example, GAP, BESTFIT, FASTA, and 
TFASTA in the Wisconsin Genetics Software Package Release 7.0, Genetics Computer Group, 575 
Science Dr.. Madison, Wl). 

The terms "substantial identity" or "substantial sequence identity" as applied to nucleic 
acid sequences and as used herein and denote a characteristic of a polynucleotide sequence, wherein 
the polynucleotide comprises a sequence that has at least 85 percent sequence identity, preferably at 
least 90 to 95 percent sequence identity, and more preferably at least 99 percent sequence identity as 
compared to a reference sequence over a comparison window of at least 20 nucleotide positions, 
frequently over a window of at least 25-50 nucleotides, wherein the percentage of sequence identity is 
calculated by comparing the reference sequence to the polynucleotide sequence which may include 
deletions or additions which total 20 percent or less of the reference sequence over the window of 
comparison. The reference sequence may be a subset of a larger sequence. 

As applied to polypeptides, the terms "substantial identity" or "substantial sequence 
Identity" mean that two peptide sequences, when optimally aligned, such as by the programs GAP or 
BESTFIT using default gap weights, share at least 80 percent sequence identity, preferably at least 90 
percent sequence identity, more preferably at least 95 percent sequence identity or more. 
"Percentage amino acid identity" or "percentage amino acid sequence identity" refers to a comparison 
of the amino acids of two polypeptides which, when optimally aligned, have approximately the 
designated percentage of the same amino acids. For example. "95% amino acid identity" refers to a 
comparison of the amino acids of two polypeptides which when optimally aligned have 95% amino acid 
identity. Preferably, residue positions which are not identical differ by conservative amino acid 
substitutions. For example, the substitution of amino acids having similar chemical properties such as 
charge or polarity are not likely to effect the properties of a protein. Examples include glutamine for 
asparagine or glutamic acid for aspartic acid. 
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The phrase "substantially purified" or "isolated" when referring to a peptide or protein, 
means a chemical composition which is essentially free of other cellular components. It is preferably in 
a homogeneous state although it can be in either a dry or aqueous solution. Purity and homogeneity 
are typically determined using analytical chemistry techniques such as polyacrylamide gel 
electrophoresis or high performance liquid chromatography. A protein which is the predominant 
species present in a preparation is substantially purified. Generally, a substantially purified or isolated 
protein will comprise more than 80% of all macromolecular species present in the preparation. 
Preferably, the protein is purified to represent greater than 90% of all macromolecular species present. 
More preferably the protein is purified to greater than 95%, and most preferably the protein is purified to 
essential homogeneity, wherein other macromolecular species are not detected by conventional 
techniques. 

The phrase "specifically binds to an antibody" or "specifically immunoreactive with", 
when referring to a protein or peptide, refers to a binding reaction which is determinative of the 
presence of the protein in the presence of a heterogeneous population of proteins and other biologies. 
Thus, under designated immunoassay conditions, the specified antibodies bind to a particular protein 
and do not bind in a significant amount to other proteins present in the sample. Specific binding to an 
antibody under such conditions may require an antibody that is selected for its specificity for a particular 
protein. A variety of immunoassay formats may be used to select antibodies specifically 
immunoreactive with a particular protein. For example, solid-phase ELISA immunoassays are routinely 
used to select monoclonal antibodies specifically immunoreactive with a protein. See Harlow and Lane 
(1988) Antibodies, a Laboratory Manual, Cold Spring Harbor Publications, New York, for a description 
of immunoassay formats and conditions that can be used to determine specific immunoreactivity. 

As used herein, "EST" or "Expressed Sequence Tag " refers to a partial DNA or cDNA 
sequence of about 150 to 500, more preferably about 300, sequential nucleotides of a longer 
sequence obtained from a genomic or cDNA library prepared from a selected cell, cell type, tissue or 
tissue type, or organisms which longer sequence corresponds to an mRNA or a gene found in that 
library. An EST is generally DNA. One or more libraries made from a single tissue type typically 
provide at least 3000 different (i.e. unique) ESTs and potentially the full complement of all possible 
EST's representing all possible cDNAs. e.g.. 50,000 - 100.000 in an animal such as a human. (See. 
for example, Adams etal. Science 252:1651-1656 (1991)). 

"Stringent" as used herein refers to hybridization and wash conditions of 50% formamide at 
42° C. Other stringent hybridization conditions may also be selected. Generally, stringent conditions 
are selected to be about 5° C lower than the thermal melting point (Tm) for the specific sequence at a 
defined ionic strength and pH. The Tm is the temperature (under defined ionic strength and pH) at 
which 50% of the target sequence hybridizes to a perfectly matched probe. Typically, stringent 
conditions will be those in which the salt concentration is at least about 0.02 molar at pH 7 and the 
temperature is at least about 60°C. As other factors may significantly affect the stringency of 
hybridization, including, among others, base composition and size of the complementary strands, the 
presence of organic solvents and the extent of base mismatching, the combination of parameters is 
more important than the absolute measure of any one. 
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B. Transcrip t Map and New Genes near HH 

The instant invention provides a fine structure map of the 1 megabase region 
surrounding the HFE gene. As part of that map the instant invention provides approximately 250 kb of 
DNA sequence of which about 235 kb are provided in Figure 8 and eight loci of particular interest 
corresponding to candidate genes within the 1 megabase region. These loci are useful as genetic and 
physical markers for further mapping studies. Additionally, the eight cDNA sequences corresponding 
to those loci are useful, for example, for the isolation of other genes in putative gene families, the 
identification of homologs from other species, and as probes for diagnostic assays. In particular, 
isolated nucleic acid sequences of at least 18 nucleotides substantially identical to contiguous 
nucleotides of a cDNA of the invention are useful as PCR primers. Typically, the PCR primer will be 
used as part of a pair of primers in a PCR reaction. Isolated nucleic acid sequences preferably 
comprising about 18-100 nucleotides, more preferably at least 18 nucleotides, substantially identical to 
contiguous nucleotides in a cDNA of the invention are useful in the design of PCR primers and probes 
for hybridization assays. Additionally, the proteins encoded by those cDNAs are useful in the 
generation of antibodies for analysis of gene expression and in diagnostic assays, and in the 
purification of related proteins. 

Thus, in one embodiment of the invention, a 235 kb sequence is provided for the HFE 
subregion within the 1 megabase region mapped. This sequence can serve as a reference in genetic 
or physical analysis of deletions, substitutions, and insertions in that region. Additionally, the sequence 
information provides a resource for the further identification of new genes in that region. Thus, nucleic 
acid sequences substantially identically to the 235 kb sequence are also included in the scope of this 
invention. 

In a further embodiment of the invention, a family of five genes, BTF1-5, is provided 
which are related by sequence homology to the milk protein butyrophilin (BT) (Figures 1 , 3, and 7). 
The predicted amino acid sequences of the proteins encoded by these genes are provided in Figure 3. 
These cDNAs are useful for the identification of further members of the BT family and to study 
regulation of expression of this family of genes. The proteins encoded by these cDNAs can be useful 
in the identification and isolation of ligands for the BT protein, and in the generation of agonists or 
antagonists of BT function. Nucleic acid sequences substantially identically to BTF1-5 and the proteins 
encoded by them are also included in the scope of this invention, including allelic forms. 

In a further embodiment of the invention, a novel gene RoRet is provided, which is 
related by sequence homology to the 52 kD Ro/SSA Lupus and Sjogren's syndrome autoantigen. This 
sequence is especially useful in the identification of other genes that may be involved in Lupus or 
Sjorgen's syndrome. The protein encoded by this cDNA can be useful in the identification and isolation 
of ligands for the autoantigen, and in the generation of agonists or antagonists of the antigen. Nucleic 
acid sequences substantially identically to RoRet and the proteins encoded by them are also included 
in the scope of this invention. 

In a further embodiment of the invention, two genes, NPT3 and NPT4, with structural 
homology to a type 1 sodium transport gene are provided. These cDNAs and the proteins expressed 
by them are useful in determining the etiology of hypophosphatemia, along with being useful as probes 
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in the identification and isolation of further members of the gene family. Nucleic acid sequences 
substantially identically to the NPT1-like sequences and the proteins encoded by them are also 
included in the scope of this invention. 
C. Polymorphic Markers 

The invention provides 397 new polymorphic sites in the region of the HFE gene. 
These polymorphisms are listed in Table 1 . As described below, these polymorphisms were identified 
by comparison of the DNA sequence of an affected individual homozygous for the common ancestral 
HH mutation with that of an unaffected individual disclosed in copending U.S. 08/724,394. 
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♦ D6S2238 occurs at base 1. 24dl occurs at base 41316. D6S2239 occurs at base 84841. D6S2241 occurs at base 235032 



45 Table 2. Polymorphic Allele Frequencies 
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35 These polymorphisms provide surrogate markers for use in diagnostic assays to 

detect the likely presence of the mutations 24d1 and/or 24d2, in preferably 24d1 , in homozygotes or 
heterozygotes. Thus, for example, DNA or RNA from an individual is assessed for the presence or 
absence of a genotype defined by a polymorphic allele of Table 1 , wherein, as a result, the absence of 
a genotype defined by a polymorphic allele of Table 1 indicates the likely absence of the HFE gene 

40 mutation in the genome of the individual and the presence of the genotype indicates the likely presence 
of the HFE gene mutation in the genome of the individual. 

These markers may be used singly, in combination with each other, or with other 
polymorphic markers (such as those disclosed in co-pending PCT application WO 96/06583) in 
diagnostic assays for the likely presence of the HFE gene mutation in an individual. For example, any 

45 of the markers defined by the polymorphic sites of Table 1 can be used in diagnostic assays in 

combination with 24d1 or 24d2, or at least one of polymorphisms HHP-1 , HHP-1 9, or HHP-29, or 
microsatellite repeat alleles 19D9:205; 18B4:235; 1A2:239; 1E4:271; 24E2:245; 2B8:206; 3321-1:98; 
4073-1:182; 4440-1:180; 4440-2:139; 731-1:177; 5091-1:148; 3216-1 :221 ; 4072-2:170; 950-1:142; 
950-2:164; 950-3:165; 950-4:128; 950-6:151 ; 950-8:137; 63-1:151; 63-2:113; 63-3:169; 65-1:206; 65- 
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2:159; 68-1:167; 241-5:108; 241-29:113; 373-8:151 ; and 373-29:113, D6S258:199, D6S265:122. 
D6S105:124; D6S306:238; D6S464:206; and D6S1001:180. 

Table 2 lists the frequency of about 100 of the alleles defined by the polymorphic sites 
of the invention in the general population. As is evident from the Table, certain of these alleles are 
present rarely in the general population. These polymorphisms are thus preferred as surrogate 
markers in diagnostic assays for the presence of a mutant HFE allele ("gene mutation' 1 ) such as 24d1 
or 24d2. Preferably, the frequency of the polymorphic allele used in the diagnostic assay in the 
general population is less than about 50%, more preferably less than about 25%, and most preferably 
less than about 5%. Thus, of the genotypes defined by the alleles listed in Table 2, polymorphisms 
occurring at base 35983 and base 61465 of Figure 1 are preferred. 

It will be understood by those of skill in the art that because they were identified in an 
ancestral HH homozygote, the haplotypes defined by the polymorphic sites of Table 1 are predictive of 
the likely presence of the HFE gene mutation 24d1 . Thus, for example, the likelihood of any affected 
individual having at least two or more of any of the polymorphic alleles defined by Table 1 is greater 
than that for any unaffected individual. Similarly, the likelihood of any affected individual having at least 
three or more of ar^ of the polymorphic alleles defined by Table 1 is greater than that for any 
unaffected individual. 

Thus, for example, in a diagnostic assay for the likely presence of the HFE gene 
mutation in the genome of the individual, DNA or RNA from the individual is assessed for the presence 
or absence of a haplotype of Table 1, wherein, as a result, the absence of a haplotype of Table 1 
indicates the likely absence of the HFE gene mutation in the genome of the individual and the 
presence of the haplotype indicates the likely presence of the HFE gene mutation in the genome of the 
individual. 

The markers defined by the polymorphic sites of Table 1 are additionally useful as 
markers for genetic analysis of the inheritance of certain HFE alleles and other genes which occur 
within the chromosomal region corresponding to the sequence of Figure 9 which include, for example, 
those disclosed in copending U. S.S.N. 08/724,394. 

As the entire nucleotide sequence of the region is provided in Figure 9, it will be 
evident to those of ordinary skill in the art which sequences to use as primers or probes for detecting 
each polymorphism of interest. Thus, in some embodiments of the invention, the nucleotide 
sequences of the invention include at least one oligonucleotide pair selected from the sequence of 
Figure 9 or its complement for amplification of a polymorphic site of Table 1 . Furthermore, in some 
embodiments of the invention a preferred hybridization probe is an oligonucleotide comprising at least 
8 to about 1 00 consecutive bases from the sequence of Figure 9, or the complement of the sequence, 
wherein the at least 8 to about 1 00 consecutive bases includes at least one polymorphic site of Table 
1 . In some embodiments the polymorphic site is at base 35983 or base 61465. 

It will also be appreciated that the nucleic acid sequences of the invention include 
isolated nucleic acid molecules comprising about 100 consecutive bases to about 235 kb substantially 
identical to the sequence of Figure 9, wherein the DNA molecule comprises at least one polymorphic 
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site of Table 1 . Such isolated DNA sequences are useful as primers, probes, or as the component of a 
kit in diagnostic assays for detecting the likely presence of the HFE gene mutation in an individual. 
D. Nucleic A cid Based Screening 

Individuals carrying polymorphic alleles of the invention may be detected at either the 
DNA, the RNA, or the protein level using a variety of techniques that are well known in the art. The 
genomic DNA used for the diagnosis may be obtained from body cells, such as those present in 
peripheral blood, urine, saliva, bucca, surgical specimen, and autopsy specimens. The DNA may be 
used directly or may be amplified enzymatically in vitro through use of PCR (Saiki et at. Science 
239:487-491 (1988)) or other in vitro amplification methods such as the ligase chain reaction (LCR) 
(Wu and Wallace Genomics 4:560-569 (1989)), strand displacement amplification (SDA) (Walker et al. 
Proc. Natl. Acad. Sci, U,S.A. 89:392-396 (1992)), self-sustained sequence replication (3SR) (Fahy et 
al. PCR Methods Appl 1:25-33 (1992)), prior to mutation analysis. The methodology for preparing 
nucleic acids in a form that is suitable for mutation detection is well known in the art. 

The detection of polymorphisms in specific DNA sequences, such as in the region of 
the HFE gene, can be accomplished by a variety of methods including, but not limited to, restriction- 
fragment-length-polymorphism detection based on allele-specific restriction-endonuclease cleavage 
(Kan and Dozy Lancet ii:91 0-91 2 (1978)), hybridization with allele-specific oligonucleotide probes 
(Wallace et al. Nucl Acids Re? 6:3543-3557 (1978)), including immobilized oligonucleotides (Saiki et 
aL Proc. Natl. Acacj, get. U.g.A. 86:6230-6234 (1989)) or oligonucleotide arrays (Maskos and Southern 
Nucl Acids Res 21 :2269-2270 (1 993)), allele-specific PCR (Newton et al. Nucl Acids Res 1 7:2503- 
2516 (1989)), mismatch-repair detection (MRD) (Faham and Cox Genome Res 5:474-4fl? (1995)), 
binding of MutS protein (Wagner et al. Nucl Acids Res 23:3944-3948 (1995), denaturing-gradient gel 
electrophoresis (DGGE) (Fisher and Lerman et al. Proc. Natl. Acad. ScL USA 80:1579-1583 (1983)), 
single-strand-conformation-polymorphism detection (Orita et al. Genomics 5:874-879 (1983)), RNAase 
cleavage at mismatched base-pairs (Myers et al. Science 230:1242 (1985)), chemical (Cotton et al. 
Prpc. Natl, Acacj, Sci. U.S.A, 85:4397-4401 (1988)) or enzymatic (Youil et aL Proc. Natl. Acad Sri 
LLSA 92:87-91 (1995)) cleavage of heteroduplex DNA, methods based on allele specific primer 
extension (SyvSnen et al. Genomics 8:684-692 (1990)), genetic bit analysis (GBA) (Nikiforov et al. Nucl 
Acids Res 22:4167-4175 (1994)), the oligonucleotide-ligation assay (OLA) (Landegren et aL Science 
241:1077 (1988)), the allele-specific ligation chain reaction (LCR) (Barranv Proc. Natl. Acad Sri 

88:189-193 (1991)), gap-LCR (Abravaya et aL Nucl Acids Res 23:675-682 (1995)), radioactive 
and/or fluorescent DNA sequencing using standard procedures well known in the art, and peptide 
nucleic acid (PNA) assays (Orum et aL, Nucl. Acids Res. 21 :5332-5356 (1993); Thiede et aL, Nucl. 
Acids Res. 24:983-984 (1 996)). 

In addition to the genotypes defined by the polymorphisms of the invention, as 
described in co-pending PCT application WO 96/35802 published November 14, 1996, genotypes 
characterized by the presence of the alleles 19D9:205; 18B4:235; 1A2:239; 1E4:271; 24E2:245; 
2B8:206; 3321-1:98 (denoted 3321-1:197 therein); 4073-1:182; 4440-1:180; 4440-2:139; 731-1:177; 
5091-1:148; 3216-1 :221 ; 4072-2:170 (denoted 4072-2:148 therein); 950-1:142; 950-2:164; 950-3:165; 
950-4:128; 950-6:151; 950-8:137; 63-1:151; 63-2:113; 63-3:169; 65-1:206; 65-2:159; 68-1:167; 241- 
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5:108; 241-29:113; 373-8:151 ; and 373-29:113, alleles D6S258:199, D6S265:122, D6S105:124, 
D6S306:238, D6S464:206; and D6S1001 :180, and/or alleles associates with the HHP-1 . the HHP-19 
or HHP-29 single base-pair polymorphisms can also be used to assist in the identification of an 
individual whose genome contains 24d1 and/or 24d2. For example, the assessing step can be 
performed by a process which comprises subjecting the DNA or RNA to amplification using 
oligonucleotide primers flanking a polymorphism of Table 1 , and oligonucleotides flanking 24d1 and/or 
24d2, oligonucleotide primers flanking at least one of the base-pair polymorphisms HHP-1 HHP-19 
and HHP-29, oligonucleotide primers flanking at least one of the microsatellite repeat alleles, or 
oligonucleotide primers for any combination of polymorphisms or microsatellite repeat alleles thereof. 

Oligonucleotides useful in diagnostic assays are typically at least 8 consecutive 
nucleotides in length, and may range upwards of 18 nucleotides in length to greater than 100 or more 
consecutive nucleotides. Such oligonucleotides can be derived from either the genomic DNA of Figure 
8 or 9, or cDNA sequences derived therefrom, or may be synthesized. 

Additionally, the proteins encoded by such cDNAs are useful in the generation of 
antibodies for analysis of gene expression and in diagnostic assays, and in the purification of related 
proteins. 

E. General Methods 

The nucleic acid compositions of this invention, whether RNA, cDNA, genomic DNA, 
or a hybrid of the various combinations, may be isolated from natural sources, including cloned DNA, 
or may be synthesized in vitro. The nucleic acids claimed may be present in transformed or 
transfected whole cells, in a transformed or transfected cell lysate, or in a partially purified or 
substantially pure form. 

Techniques for nucleic acid manipulation of the 
nucleic acid sequences of the invention such as subcloning nucleic acid sequences encoding 
polypeptides into expression vectors, labeling probes, DNA hybridization, and the like are described 
generally in Sambrook etal.. Molecular Cloning - a Labor atory Manual (2nd Ed.). Vol. 1-3. Cold Spring 
Harbor Laboratory. Cold Spring Harbor. New York, (1989), which is incorporated herein by reference. 
This manual is hereinafter referred to as "Sambrook etal." 

There are various methods of isolating the nucleic acid sequences of the invention. 
For example, DNA is isolated from a genomic or cDNA library using labeled oligonucleotide probes 
having sequences complementary to the sequences disclosed herein. Such probes can be used 
directly in hybridization assays. Alternatively probes can be designed for use in amplification 
techniques such as PCR. 

To prepare a cDNA library, mRNA is isolated from tissue such as heart or pancreas, 
preferably a tissue wherein expression of the gene or gene family is likely to occur. cDNA is prepared 
from the mRNA and ligated into a recombinant vector. The vector is transfected into a recombinant 
host for propagation, screening and cloning. Methods for making and screening cDNA libraries are 
well known. See Gubler. U. and Hoffman, B.J. Gene 25:263-269 (1983) and Sambrook etal. 

For a genomic library, for example, the DNA is extracted from tissue and either 
mechanically sheared or enzymatically digested to yield fragments of about 12-20 kb. The fragments 
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are then separated by gradient centrifugation from undesired sizes and are constructed in 
bacteriophage lambda vectors. These vectors and phage are packaged in vitro, as described in 
Sambrook. etal. Recombinant phage are analyzed by plaque hybridization as described in Benton 
and Davis. Science 196:180-182 (1977). Colony hybridization is carried out as generally described in 
M. Grunstein etal. Proc. Natl. Acad. Sci. USA 72:3961-3965 (1975). 

DNA of interest is identified in either cDNA or genomic libraries by its ability to hybridize 
with nucleic acid probes, for example on Southern blots, and these DNA regions are isolated by 
standard methods familiar to those of skill in the art. See Sambrook, et al. 

In PCR techniques, oligonucleotide primers complementary to the two 3' borders of 
the DNA region to be amplified are synthesized. The polymerase chain reaction is then carried out 
using the two primers. See PCR Protocols: a Guide to Methods and Annli^tinn C ( | nnis> M> Gelfandi 
D.. Sninsky, J. and White, T., eds.), Academic Press, San Diego (1990). Primers can be selected to 
amplify the entire regions encoding a full-length sequence of interest or to amplify smaller DNA ' 
segments as desired. 

PCR can be used in a variety of protocols to isolate cDNA's encoding a sequence of 
interest. In these protocols, appropriate primers and probes for amplifying DNA encoding a sequence 
of interest are generated from analysis of the DNA sequences listed herein. Once such regions are 
PCR-amplified. they can be sequenced and oligonucleotide probes can be prepared from sequence 
obtained. 

Oligonucleotides for use as primers or probes are chemically synthesized according to 
the solid phase phosphoramidite triester method first described by Beaucage, S.L. and Carruthers, 
M.H., Tetrahedron Lett, . 22(20):1859-1862 (1981) using an automated synthesizer, as described in 
Needham-VanDevanter, D.R., etal.. Nucleic Acids Rp* 12:6159-6168 (1984). Purification of 
oligonucleotides is by either native acrylamide gel electrophoresis or by anion-exchange HPLC as 
described in Pearson, J.D. and Regnier, F.E., J, Chrpm,, 255:137-149 (1983). The sequence of the 
synthetic oligonucleotide can be verified using the chemical degradation method of Maxam, A.M. and 
Gilbert. W., in Grossman, L. and Moldave, D., eds. Academic Press. New York, Methods in 
EnzyjUfilfigy 65:499-560 (1980). 
1. Expression 

Once DNA encoding a sequence of interest is isolated and cloned, one can express 
the encoded proteins in a variety of recombinantly engineered cells. It is expected that those of skill in 
the art are knowledgeable in the numerous expression systems available for expression of DNA 
encoding a sequence of interest. No attempt to describe in detail the various methods known for the 
expression of proteins in prokaryotes or eukaryotes is made here. 

In brief summary, the expression of natural or synthetic nucleic acids encoding a 
sequence of interest will typically be achieved by operably linking the DNA or cDNA to a promoter 
(which is either constitutive or inducible), followed by incorporation into an expression vector. The 
vectors can be suitable for replication and integration in either prokaryotes or eukaryotes. Typical 
expression vectors contain transcription and translation terminators, initiation sequences, and 
promoters useful for regulation of the expression of polynucleotide sequence of interest. To obtain 
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high level expression of a cloned gene, it is desirable to construct expression plasmids which contain, 
at the minimum, a strong promoter to direct transcription, a ribosome binding site for translational 
initiation, and a transcription/translation terminator. The expression vectors may also comprise generic 
expression cassettes containing at least one independent terminator sequence, sequences permitting 
5 replication of the plasmid in both eukaryotes and prokaryotes, i.e., shuttle vectors, and selection 

markers for both prokaryotic and eukaryotic systems. See Sambrook et a/. Examples of expression of 
ATP-sensitive potassium channel proteins in both prokaryotic and eukaryotic systems are described 
below. 

a- Expression in Prokarvotes 

1 0 A variety of procaryotic expression systems may be used to express the proteins of the 

invention. Examples include E. co//, Bacillus, Streptomyces, and the like. 

It is preferred to construct expression plasmids which contain, at the minimum, a 
strong promoter to direct transcription, a ribosome binding site for translational initiation, and a 
transcription/translation terminator. Examples of regulatory regions suitable for this purpose in E. coli 

1 5 are the promoter and operator region of the E. coli tryptophan biosynthetic pathway as described by 
Yanofsky, C. t J. Bacteriol. 158:1018-1024 (1984) and the leftward promoter of phage lambda (PA) as 
described by Herskowitz, I. and Hagen, D., Ann. Rev. Genet 14:399-445 (1980). The inclusion of 
selection markers in DNA vectors transformed in E coli is also useful. Examples of such markers 
include genes specifying resistance to ampicillin, tetracycline, or chloramphenicol. See Sambrook et 

20 a/, for details concerning selection markers for use in E. coli. 

To enhance proper folding of the expressed recombinant protein, during purification 
from E. co//, the expressed protein may first be denatured and then renatured. This can be 
accomplished by solubilizing the bacterially produced proteins in a chaotropic agent such as guanidine 
HCI and reducing all the cysteine residues with a reducing agent such as beta-mercaptoethanol. The 

25 protein is then renatured, either by slow dialysis or by gel filtration. See U.S. Patent No. 4,51 1 ,503. 

Detection of the expressed antigen is achieved by methods known in the art as 
radioimmunoassay, or Western blotting techniques or immunoprecipitation. Purification from E. coli 
can be achieved following procedures such as those described in U.S. Patent No. 4,51 1 ,503. 
b. Expression in Eukarvotes 

30 A variety of eukaryotic expression systems such as yeast, insect cell lines, bird, fish, 

and mammalian cells, are known to those of skill in the art. As explained briefly below, a sequence of 
interest may be expressed in these eukaryotic systems. 

Synthesis of heterologous proteins in yeast is well known. Methods in Yeast Genetics , 
Sherman, F., et a/., Cold Spring Harbor Laboratory, (1982) is a well recognized work describing the 

35 various methods available to produce the protein in yeast. 

Suitable vectors usually have expression control sequences, such as promoters, 
including 3-phosphoglycerate kinase or other glycolytic enzymes, and an origin of replication, 
termination sequences and the like as desired. For instance, suitable vectors are described in the 
literature (Botstein, e/e/., Gene 8:17-24 (1979); Broach, eta!.. Gene 8:121-133 (1979)). 
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Two procedures are used in transforming yeast cells. In one case, yeast cells are first 
converted into protoplasts using zymolyase, lyticase or glusulase, followed by addition of DNA and 
polyethylene glycol (PEG). The PEG-treated protoplasts are then regenerated in a 3% agar medium 
under selective conditions. Details of this procedure are given in the papers by J.D. Beggs, Nature 
5 (London) 275:104-109 (1978); and Hinnen, a., etai.. Proc. Natl. Acad Sci. U.S.A. 75:19S>Q-1Q33 

(1978). The second procedure does not involve removal of the cell wall. Instead the cells are treated 
with lithium chloride or acetate and PEG and put on selective plates (Ito, H., etai, ^LfiacL 153:163-168 
(1983)). 

The proteins of the invention, once expressed, can be isolated from yeast by lysing the 
cells and applying standard protein isolation techniques to the lysates. The monitoring of the 
purification process can be.accomplished by using Western blot techniques or radioimmunoassay or 
other standard immunoassay techniques. 

The sequences encoding the proteins of the invention can also be ligated to various 
expression vectors for use in transforming cell cultures of, for instance, mammalian, insect, bird or fish 
origin. Illustrative of cell cultures useful for the production of the polypeptides are mammalian cells. 
Mammalian cell systems often will be in the form of monolayers of cells although mammalian cell 
suspensions may also be used. A number of suitable host cell lines capable of expressing intact 
proteins have been developed in the art, and include the HEK293, BHK21, and CHO cell lines, and 
various human cells such as COS cell lines, HeLa cells, myeloma cell lines, Jurkat cells, etc. 
Expression vectors for these cells can include expression control sequences, such as an origin of 
replication, a promoter (e.g., the CMV promoter, a HSV tk promoter or pgk (phosphoglycerate kinase) 
promoter), an enhancer (Queen etai Immunol. Rev. 89:49 (1986)), and necessary processing 
information sites, such as ribosome binding sites, RNA splice sites, polyadenylation sites (e.g., an SV40 
large T Ag poly A addition site), and transcriptional terminator sequences. Other animal cells useful for 
production of ATP-sensitive potassium channel proteins are available, for instance, from the American 
Type Culture Collection Catalogue of Cell Lines and Hybridomas (7th edition, (1992)). 

Appropriate vectors for expressing the proteins of the invention in insect cells are 
usually derived from the SF9 baculovirus. Suitable insect cell lines include mosquito larvae, silkworm, 
armyworm, moth and Drosophila cell lines such as a Schneider cell line (See Schneider J. Embryol. 
Exp. MorphoL 27:353-355 (1987). 

As indicated above, the vector, e.g., a plasmid, which is used to transform the host 
cell, preferably contains DNA sequences to initiate transcription and sequences to control the 
translation of the protein. These sequences are referred to as expression control sequences. 

As with yeast, when higher animal host cells are employed, polyadenylation or 
transcription terminator sequences from known mammalian genes need to be incorporated into the 
vector. An example of a terminator sequence is the polyadenylation sequence from the bovine growth 
hormone gene. Sequences for accurate splicing of the transcript may also be included. An example 
of a splicing sequence is the VP1 intron from SV40 (Sprague, J. et af. t J. Virol. 45: 773-781 (1 983)). 

Additionally, gene sequences to control replication in the host cell may be 
incorporated into the vector such as those found in bovine papilloma virus type-vectors. 
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Saveria-Campo, M., 1985, "Bovine Papilloma virus DNA a Eukaryotic Cloning Vector" in DNA Cloning 
Vol II a Practical Approach Ed. D.M. Glover, IRL Press, Arlington, Virginia pp. 213-238. 

The host cells are competent or rendered competent for transformation by various 
means. There are several well-known methods of introducing DNA into animal cells. These include: 
5 calcium phosphate precipitation, fusion of the recipient cells with bacterial protoplasts containing the 
DNA, treatment of the recipient cells with liposomes containing the DNA, DEAE dextran, 
electroporation and micro-injection of the DNA directly into the cells. 

The transformed cells are cultured by means well known in the art (Biochemical 
Methods in Cell Culture and Virology . Kuchler, R.J., Dowden, Hutchinson and Ross. Inc., (1977)). The 
10 expressed polypeptides are isolated from cells grown as suspensions or as monolayers. The latter are 
recovered by well known mechanical, chemical or enzymatic means. 

2. Purification 

The proteins produced by recombinant DNA technology may be purified by standard 
techniques well known to those of skill in the art. Recombinantly produced proteins can be directly 

1 5 expressed or expressed as a fusion protein. The protein is then purified by a combination of cell lysis 
(e.g., sonication) and affinity chromatography. For fusion products, subsequent digestion of the fusion 
protein with an appropriate proteolytic enzyme releases the desired polypeptide. 

The polypeptides of this invention may be purified to substantial purity by standard 
techniques well known in the art, including selective precipitation with such substances as ammonium 

20 sulfate, column chromatography, immunopurification methods, and others. See, for instance, R. 

Scopes, Protein Purification: Principles and Practice . Springer-Verlag: New York (1982), incorporated 
herein by reference. For example, in an embodiment, antibodies may be raised to the proteins of the 
invention as described herein. Cell membranes are isolated from a cell line expressing the 
recombinant protein, the protein is extracted from the membranes and immunoprecipitated. The 

25 proteins may then be further purified by standard protein chemistry techniques as described above. 

3. Antibodies 

As mentioned above, antibodies can also be used for the screening of polypeptide 
products encoded by the polymorphic nucleic acids of the invention. In addition, antibodies are useful 
in a variety of other contexts in accordance with the present invention. Such antibodies can be utilized 
30 for the diagnosis of HH and, in certain applications, targeting of affected tissues. 

Thus, in accordance with another aspect of the present invention a kit is provided that 
is suitable for use in screening and assaying for the presence of polypeptide products encoded by the 
polymorphic nucleic acids of the invention by an immunoassay through use of an antibody which 
specifically binds to polypeptide products encoded by the polymorphic nucleic acids of the invention in 
35 combination with a reagent for detecting the binding of the antibody to the gene product. 

Once hybridoma cell lines are prepared, monoclonal antibodies can be made through 
conventional techniques of priming mice with pristane and interperitoneally injecting such mice with the 
hybrid cells to enable harvesting of the monoclonal antibodies from ascites fluid. 

In connection with synthetic and semi-synthetic antibodies, such terms are intended to 
40 cover antibody fragments, isotype switched antibodies, humanized antibodies (mouse-human, human- 
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mouse, and the like), hybrids, antibodies having plural specificities, fully synthetic antibody-like 
molecules, and the like. 

This invention also embraces diagnostic kits for detecting DNA or RNA comprising a 
polymorphism of Table 1 in tissue or blood samples which comprise nucleic acic, probes as described 
herein and instructional material. The kit may also contain additional components such as labeled 
compounds, as described herein, for identification of duplexed nucleic acids. 

The following examples are provided to illustrate the invention but not to limit its scope. 
Other variants of the invention will be readily apparent to one of ordinary skill in tie art and are 
encompassed by the appended claims. 
F. EXPERIMENTAL EXAMPLES 

1. Meaabase transcript map 

In these studies direct selection, exon-trapping, and genomic sample sequencing were 
used to generate a transcript map of a 1 megabase region approximately 8.5 megabases telomeric to 
HLA-A in the vicinity of HFE. This region 6p21 .3 was flanked by the genetic markers D6S2242 and 
D6S2241 . The starting material for these experiments was a 1 megabase YAC labeled y899g1 and a 
bacterial clone contig of this region (Feder et al. Nature Genetics 1 3:399-408 (1996)). These 
techniques and other methods used in the study are outlined below. 

a. Direct Selection (PS) 

Poly A* RNA from human fetal brain, liver and small intestine (Clontech, Palo Alto, 
CA) were converted into cDNA using random primers and a Superscript cDNA synthesis kit (Life 
Technologies, Gaithersburg, MD). The cDNA was digested with Mbo I and ligated to cDNA Mbo I 
linker-adaptors. Unligated linker-adaptor were removed by passage through cDNA spun columns 
(Pharmacia, Piscataway, NJ). The 5 ng of each of the ligated cDNAs were amplified using the cDNA 
Mbo l-S primer (S'-CCTGATGCTCGAGTGAATTC-S'). The amplified products were purified on S-400 
spin columns (Pharmacia, Piscataway, NJ), ethanol precipitated and resuspended at tmg/ml in TE. 
Gel-purified yac899g1 (Centre d'Etude du Polymorphisme Humain) was processed as described by 
Morgan etai (Nucl. Acids Res. 20:5173-5179 (1992)). The cDNAs were mixed in equal molar 
amounts for a total of 3 mg, and blocked with a mixture of 4 mg Cot-1 DNA (Life Technologies, 
Gaithersburg, MD), and a cocktail of Sau 3A-digested ribosomal and five different histone DNAs. The 
blocked cDNAs were hybridized to biotinylated yac899g1 DNA and streptavidin capture was carried out 
as described by Morgan era/, (ibid). After the second round of selection, the eluted cDNAs were 
amplified using the cDNA Mbo l-S primer which included a (CUA)4 repeat at the 5' end to facilitate 
cloning into a version of pSP72 (Promega, Madison, Wl) constructed for use with uracil-DNA 
glycolyase cloning (UDG, Life Technologies, Gaithersburg, MD). Recombinants were transformed in 
DH5cc f 1000 clones picked into a 96 well format, and clones prepped for DNA sequencing using AGTC 
boiling 96-well mini-prep system (Advance Genetic Technologies, Gaitherburg, MD). 

Four hundred and sixty five clones were sequenced and the resulting data searched 
by BLAST (Altschul etai J. MoL Biol, 215:403-410 (1990)). Those clones representing repetitive, 
bacterial, yeast, mitochondria! and histone sequences were eliminated from future considerations. The 
remaining sequences were then searched for overlaps and assembled into 108 unique DS contigs. 
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The number of clones per DS contig varied between 1 to 22 with the length of each contig ranging 
from 250bp to 850 bp. Small sequence-tag-sites PCR assays were developed for each DS contig and 
two experiments were carried out concomitantly; mapping each DS contig back to the bacterial clone 
contig of the region and testing for the presence of each DS contig in cDNA libraries. Overall, 86 or 
80% of the DS contigs mapped back to the region and were found to be in cDNA libraries. The number 
of 80% mapping to the region was probably an underestimate of the fidelity of the direct-selection since 
PCR assays which cross exon-intron boundaries would be expected to fail or give larger size products, 
thereby being scored negative, 
b. Exon-Trappinq 

CsCI-purified genomic P1 (Genome Systems), BAC (Research Genetics) and PAC 
(Genome Systems) DNAs were digested with BamHI, Bgl II, Pst I Sac 1 and Xho I and 125 ng of each 
digest ligated into 500 ng pSPL3 (Church eta/. Nature Genetics 6:98-105 (1994)) (Life Technologies, 
Gaithersburg, MD) digested with the appropriate restriction enzyme and phosphatased with calf 
intestinal alkaline phosphatase (USB, Cleveland, OH). One tenth of the ligation was used to transform 
XL1-Blue MRF' cells (Stratagene, La Jolla, CA) by electroporation. Nine tenths of the electroporation 
was used to inoculate 1 0 ml of LB + 1 0Opg/ml of carbenicillen and after overnight growth, DNA was 
prepared using Qiagen Q-20 tips (Qiagen GmbH, Hilden Germany). The remaining one tenth was 
plated on LB +100 pg/ml carbenicillen plates to evaluated the efficiency on cloning and to test 
individual clones for the present of single inserts. COS-7 cells were seed overnight at a density of 1 .4 
x10 5 /well in 6 well dishes. One pg of DNA was transfected using 6ml of Lipofect-Ace. Cytoplasmic 
RNA was isolated 48 hr post-transfection. RT-PCR was carried out as described by Church et ai (ibid) 
using commercially available reagents Life Technologies, Gaithersburg, MD). The resulting 
CUA-tailed PCR fragments for each restriction digested bacterial clone were pooled and UDG cloned 
into pSP72-U (a derivative of pSP72). The DNA was transformed in DHSct and the cells plated onto 
nylon membranes. After overnight growth, duplicates were made and the DNA hybridized to 32 P 
end-labeled oligos designed to detect various background products associated with the pSPL3 vector. 
One set of filters was hybridized with the following gel-purified oligos in 6X SSC aqueous hybridization 
solution at 42° C: 

vector-vector splicing 5-CGACCCAGCAACCTGGAGAT-3' 
cryptic donor-1 021 5'-AGCTCGAGCGGCCGCTGCAG-3' 
cryptic donor-1 1 34 5'-AGACCCCAACCCACAAGAAG-3' 
The filters were washed twice in 6X SSC, 10 mM sodium pyrophosphate (NaPPi) at 60°C, 30 mins. 

After overnight autoradiography, non-hybridizing clones were picked and grown in 250 
pi of LB + 1 0Opg/ml of carbenicillin in 96 well mini-rack tubes. The samples were analyzed by PCR 
using the secondary PCR primers supplied in the kit (Life Technologies, Gaithersburg, MD) and those 
clones with inserts greater than 200 bp were selected for sequencing. 

Ninety-six exon traps per bacterial clone were sequenced for a total of 768 reactions 
and the resulting data analyzed by BLAST. In addition, each potential exon was searched against a 
database of the 86 DS contigs to eliminate redundant sequences. PCR assays were developed for 
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each of the potential exons and they were tested for their presence in cDNA libraries. A total of 48 
potential exons remained after these screening steps, 
c. Sample Sequencing 

A minimal set of bacterial clones chosen to cover y899g1 were prepped with the 
5 Qiagen Maxi-Prep system and purified on CsCI. Ten micrograms of DNA from each bacterial clone 
was sonicated in a Heat Systems Sonicator XL and end-repaired with Klenow (USB) and T4 
polymerase (USB). The sheared fragments were size selected between three to four kiiobases on a 
0.7% agarose gel and then ligated to BstXI linkers (Invitrogen). The ligations were gel purified on a 
0 .7% agarose gel and cloned into a pSP72 derivative plasmid vector. The resulting plasmids were 

10 transformed into electrocompetent DH5a cells and plated on LB-carbenicillin plates. A sufficient 
number of colonies was picked to achieve 15-fold clone coverage. The appropriate number of 
colonies was calculated by the following equation to generate a single-fold sequence coverage: 
Number of colonies = size of bacterial clone (in kb)/average sequence read length (0.4 kb). These 
colonies were prepped in the 96-well AGCT system and end-sequenced with oligo MAPI using 

15 standard ABI Dye Terminator protocols. MAPI was CGTTAGAACGCGGCTACAAT. The MAPI 

sequences were screened locally with the BLAST algorithm against all available public databases. All 
sequence identities were catalogued and cross referenced to the DS and exon-trapped databases. 

A total of 3794 end sequence reactions were run to achieve the theoretical 1X 
coverage. Eighty-five percent of these sequences contained non-bacterial non-vector inserts. An 

20 additional 1060 end sequence reactions were run from the opposite end of the cloning vector to 
augment the sequence coverage and to prepare for contigging across selected regions. BLAST 
searches to all publicly available databases identified 12 histone genes and 74 unique expressed 
sequence fragments (ESF). The ESF represent a collection of ESTs and other expressed sequence 
fragments that were selected due to their sequence identity over a significant portion of genomic DNA. 

25 The ESF were cross referenced against the DS and exon-trapped databases to eliminate 

redundancies. 58 unique ESF remained, representing 39 distinct clones. Included in these ESF are 5 
sequences homologous to histone genes. 

Table 3. EST's found by Sample Sequencing Large Insert Bacterial Clones 



30 



Clone name 


Bacterial 


Homology 5* 


Homology 3' 


Poly A + 


Genomic 


cDNA 




clone 


blastx 


blastx 


signal 1 


poly (A) os 


Homology 


EST03556 


pcl57c3 


na 2 


none 3 


+ 




cDNA 28 


ym33f 1 1 


pcl57c3 


ZNF 


na 


na 


na 




EST04698 


pcl57c3 


na 


NSH 4 


+ 






EST04812 


pcl57c3 


na 


NSH 








yb89b08 


pcl57c3 


NSH 


na 


na 


na 




yd88gll 


pcl57c3 


na 


nsh 


+ 






yj49b01 


pcl57c3 


NSH 


na 


na 


na 




yv81d05 


pcl57c3 


HG17 Human 


NSH 


+ 




cDNA 30 


yg57h09 


pl96e20 


BUTYBOVIN 


NSH 


+ 




cDNA 21 


yq23d08 


p!96e20 


BUTYBOVIN 


NSH 


+ 




cDNA 21 
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Clone name 


Bacterial 


Homology 5' 


Homology 3* 


Poly A + 


Genomic 


cDNA 




clone 


blastx 


blastx 


signal 1 


polv (A) n . 


Homology 


yo65f06 


pl96e20 


NSH 


na 


na 


na 


cDNA 29 


yv88c09 


pl96e20 


BUTYBOVIN 


na 


na 


na 


cDNA 29 


ydl7d06 


pl96e20 


NSH 


na 


na 


na 


cDNA 23 


ye25g03 


pl96e20 


BUTYBOVIN 


NSH 


na 


na 


cDNA 44 


ys04h08 


pc45p21 


NSH 


NSH 


+ 




rDMA 44 


yn01c05 


pi96e20 


BUTYBOVIN 


na 


na 


na 


cDNA 39 


YG78F10 


PC45P21 


NSH 


NSH 


na 


na 




yh54fl ! 


pl96e20 


none 


NSH 








ys05b08 


pcl57c3 


NSH 


Alu 




+ 




- ybl2hll 


M32al2 


NSH 


Histone H3.1 








HSC2EE082 


bl32al2 


na 


NSH 


+ 






HUM160hllb 


bl32al2 


none 


na 


na 


na 




yg04f09 


bl32bl2 


Line element 


Alu 




+ 




yd37dll 

i 


bl32al2 


NSH 


Alu 






1 
1 


ym29g03 


bl32al2 


Histone H2A 


NSH 






cDNA 37 


yi77b02 


bl32al2 


NSH 


NSH 






cDNA 37 


yh76b05 


bl32al2 


NSH 


Alu 








yu98e02 


bl32al2 


NSH 


Alue 








yd72hl2 


bl32al2 


All! 


NSH 


+ 


+ 




ydl9d03 


pc222k22 


Histone H2B.1 


NSH 


+ 






ye98g01 


bl32al2 


NSH 


NSH 








yi61flD7 


bl32al2 


NSH 


NSH 




+ 




ESTO5340 


b3e!7 


na 


Alu 




+ 




yd35d05 


pc222k22 


NSH 


NSH 








yc52a05 


pc75L14 


NSH 


na 


na 


na 




yd84a05 


pc75L14 


none 


none 




? 5 




| yr42a05 


pc75L14 


NaPi transport 


none 


+ 




CDNA22B | 


yd83h08 


b20h20 


NSH 


none 


+ 






ye38c09 


b20h20 


NSH 


Alu 




+ 




yp74c05 


b20h20 


NaPi transport 


Alu 


? 6 


na 





No Significant Homologies 
3' splice that is not on contig 
Poor EST sequence 



Bracketed area is the critical region 

1 Signal of ATAAA or ATTAA 4 

2 Not available 5 
35 3 "NONE" reported by blast 6 



d. cDNA library screening 

Superscript plasmid cDNA libraries, brain, liver and testis, were purchased from Life 
Technologies, Gaithersburg, MD. Colonies were plated on Hybond N filters (Amersham) using 
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standard techniques. Insert probes from DS, exons and EST (I.M.A.G.E. clones; Genome Systems) 
were all isolated by PCR followed by purification in low-melting point agarose gels (Seakem). The 
DNAs were labeled in gel using the Prime-it II kit (Stratagene, La Jolla, CA). Small exon probes were 
labeled using their respective STS PCR primers instead of random primers. Up to 5 different probes 
were pooled in a hybridization. Filters were hybridized in duplicate using standard techniques. Putative 
positives were screened by PCR using the probe's STSs to identify clones. Inserts from positive clones 
were subcloned in pSP72 and sequenced. 

e. Northern blots and RT-PCR analysis 

Multiple tissue northern blots were purchased from Clontech and hybridized according 
the manufacturer's instructions. RT-PCR was carried out on random primed first strand cDNA made 
from poly A+ RNA (Clontech) using AmpliTaq Gold (Perkin-Elmer). Control reactions were performed 
on RNA samples processed in the absence of reverse transcriptase to control for genomic DNA 
contamination. 

f. Qenomic Sequencing 

The MAPI sequences from the bacterial clones b132a2, 222K22, and 75L14 were 
assembled into contigs with the Staden package (available from Roger Staden, MRC). A minimal set 
of 3 kb clones was selected for sequencing with oligo labeled MAP2 that sits on the opposite end of the 
plasmid vector. The sequence of MAP2 was GCCGATTCATTAATGCAGGT. The MAP2 sequences 
were entered into the Staden database in conjunction with the MAPI sequences to generate a tiling 
path of 3 kb clones across the region. These sequences were also screened with the BLAST algorithm 
and all novel sequence identities were noted. The plasmid 3 kb libraries were concurrently 
transformed in 96 well format into pox38UR (available from C. Martin, Lawrence Berkeley 
Laboratories). The transformants were subsequently mated with JGM (Strathman etal. P.N.A.S. 
88:1247-1250 (1991) in 96 well format. All matings of the 3 kb clones within the tiling path were 
streaked on LB-carbenicillin-kanamycin plates and a random selection of 12 colonies per 3 kb clone 
was propped in the AGCT system. The oligos -21 : CTGTAAAACGACGGCCAGTC, and REV: 
GCAGGAAACAGCTATGACC were used to sequence off both ends of the transposon. Each 3 kb 
clone was assembled in conjunction with the end sequence information from all bacterial clones to 
generate complete sequence across the region. The genomic sequence was analyzed with the 
BLAST nucleotide and protein homology algorithms and the GRAIL 1.2 software to identify novel open 
reading frames (ORF) for gene finding. 

g. Discussion 

A compilation of 1 74 ESF led to the construction of an expressed sequence map of 
the region that served as the framework for the isolation of full-length cDNAs (Figure 1). (The map 
shows the subset of ESF that were actually mapped). Probes were developed for 82 best ESFs which 
appeared to be derived from the coding portions of cDNAs and the appropriate cDNA libraries were 
screened. This led to the isolation of 19 cDNAs, 17 of which represented novel sequences. 70 of the 
174 ESF were included in the cDNAs isolated (40%). 36 probes failed to produce any clones even 
after repeated screening of several libraries. 51 ESF which were not accounted for in the cDNAs 
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cloned were not used in any screen. Therefore, it is possible that some additional genes within this 1 
megabase region may have escaped detection. 

A list of these cDNAs cloned and a comparison of the methods used to find them is 
presented in Table 4. Direct selection found 14 out of the 18 cDNAs contained within the boundaries of 
the YAC used in the experiment. Exon trapping found 15 out of the 19 cDNAs contained within the 
boundaries of the large insert bacterial clone contig. Sample sequencing identified 1 1 genes that had 
corresponding ESTs in the public database. 

Table 4. Comparison of gene finding methods 



Bacterial Clone 

157c 

157c3 

157c3 
157c3 
P18696 



CDNA # 

28 
30 

46 
20 
21 



Homology 
zinc finger 
nonhistone 

ORF 

BT 

BTF1 



EST 

EST03556 

yv81d05 

yvh07al0 

yd88gll 

none 

yn01G5 

yg23d08 

yg57h09 

yul5h03 



DS 

2 
1 

1 

none 
4 



Exon Trap 
1 

none 



45p21 

45p21 

45p21 
45p21 
3el7 

132a2 
132a2 
132a2 

75114 
132a2 
132a2 
132a2 

20h20 
20h20 



32 

29 

23 
44 
41 

43 

36 
37 

24 
39 
27 
22B 

22E 
NPT1 



BTF2 

BTF3 

BTF4 
BTF5 
genomic? 

genomic? 
genomic? 
histone 2A 

MHC class 1 
genomic? 
Ro/SSA 
NPTl-like 

NPTl-like 
NPT1 



yg78n0 
yn01c05 
ye25g03 
yo65f06 
ydl7d06 
ys04h08 
none 

none 

none 

ym29g03 

yh87a03 

ye98g01 

none 

none 

yr42a05 

yff)9g06 

none 

yp74c05 



4 
2 

none 

none 
1 

3 

1 

none 
3 
1 

2 
N/A 



6 
4 
1 

3 

none 
none 

2 
4 
4 
7 

5 
3 
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As a final approach, a tiling path with overlapping end sequences from the sample 
sequence database was generated. Each 3 kb clone within the path was shotgun-sequenced using 
transposable elements as platforms for dual end sequencing. These individual clones were assembled 
in conjunction with the end sequences from all bacterial clones in the region. The resulting sequence 
(Figure 2) was analyzed systematically with BLAST homology searches and the Grail 1 .2 program to 
identify novel open reading frames (ORF) and other gene-like structures. The BLAST homology 
searches did not produce any probes that had not already been identified by sample sequencing. Grail 
predicted exons for all the genes in the region, but was only able assemble the histones into any 
representative form. A detailed analysis of BLAST homology searches to protein databases identified 
an enticing homology to a zinc alpha 2 glycoprotein approximately 25 kb upstream of HFE, but the lack 
of a substantial ORF and the presence of a stop codon suggested that it was a pseudogene. Figure 2 
shows the positions, the exon and intron structures, and the relative orientation of transcription of novel 
genes within this region. Also shown are the positions and transcriptional orientations of the histone 
genes. A total of 12 histone genes were identified in this study. 

In an effort to account for the ESTs that did not associate with the characterized genes 
in the 250 kb region, the genomic sequence around the putative 3' ends were examined for 
polyadenylation signals to determine whether certain EST sequences may have originated from 
genomic DNA contamination in the normalized cDNA libraries used in EST generation. The positions 
of the 14 ESTs found in this region are indicated in Figure 2 to show those associated with the cDNAs 
cloned and those which did not associate with genomic DNA of obvious coding potential. Four ESTs 
corresponded to 3 of the 4 cDNAs cloned from the region (Table 2). One EST encoded a histone 
H2B.1 gene and another was a repetitive element. Of the remaining 8, 6 EST clones were used as 
probes of cDNA libraries with negative results. Those sequences representing putative 3' ends of 
cDNA were searched for the presence of poly (A)+ addition signals. Five of the 1 3 ESTs which had 3* 
end sequence, had the sequence ATAAA or ATTAA. Five of the remaining 8 ESTs that did not have a 
poly (A)+ addition signal had genomic encoded stretches of poly (A) near the end of EST sequence 
and, therefore, may have been created by oligo d(T) priming of contaminating genomic DNA. This 
analysis was expanded to include all ESTs in the large-insert bacterial contigs with definitive 3' ends. 
Of the remaining 26, 15 had 3' end sequence and, of these, 8 had poly (A)+ addition signals. Five of 
these 8 ESTs were associated with the cloned cDNAs. Of the remaining 7 which did not have poly (A)+ 
addition signals, 4 had genomic encoded stretches of poly (A), 
i. Butvrophilin gene family 

The human homolog of the bovine butyrophilin gene (BT) was cloned and mapped to 
approximately 480 kb centromeric to HFE (Figure 1). BT is a transmembrane protein of unknown 
function which constitutes 40% of the total protein associated with the fat globule of bovine milk (Jack 
et ai J. Biol. Chem. 265:14481-14486 (1990)). A human homolog of BT has recently been cloned by 
Tayloer et ai (Bipphem Piophys Acta 1 306:1-4 (1 996)). The results in this study indicated that BT is a 
member of a gene family with at least five other members of the family residing in this region (Figure 
1). A comparison of these proteins is shown in Figure 3. The proteins were aligned based on their 
descending order of relatedness and to minimized gaps in the sequence. Each of the five proteins 



WO 98/14466 



PCT/US97/17658 



31 

display varying degrees of homology to BT. BTF1 (cDNA21), BTF2 (cDNA32), BTF5 (cDNA44) t and 
BTF3 (cDNA 29) are 45% t 48%, 46%, and 49%, identical to BT, whereas BTF4 (cDNA 23), which is 
more similar to BTF3 (cDNA 29), is only 26% identical. This low degree of identity to BT is largely due 
to a truncation at the carboxyl terminus of the protein. The BTF family falls into two groups: BTF1 and 
2 which are more related to each other than to BT or the other BTF members, and BTF5, 3 and 4, 
which appear to have a common evolutionary origin. The order of these genes on the chromosome 
suggests that the BT gene has duplicated two times, giving rise to BTF1 and BTF5. Subsequently, it 
appears likely these two genes experienced further duplication events to give rise to the other 
members in their groups. 

The three major components of BT, the B-G immunoglobulin superfamily domain 
(containing the V consensus sequence) (Miller et ai Proc. NatL Acad. Sci U ft A 88:4377-4381 
(1991)), the transmembrane region, and the B30-2 exon are found in alt of these proteins (with the 
exception of BTF4 (cDNA 23) which lacks the B30-2 exon by virtue of the carboxyl terminal truncation). 
The exon B30-2 is a previously noted feature of the MHC class 1 region found approximately 200 kb 
centromeric to the HLA-A gene (Vernet et ai, J. Mol. Evol. 37:600-612 (1993)). In addition this exon is 
found in several genes of diverse function telomeric to HLA-A namely MOG (approximately 200 kb) 
and RFP (approximately 1 megabase) (Amadou etai Genomics 26:9-20 (1995)). 

The levels of the BTF mRNA were analyzed by northern blot analysis (Figure 4A). 
The expression of the BTF genes fell into two patterns. BTF1 and BTF2 were expressed as a single 
major transcript of 2.9 kb and one minor transcript of 5.0 kb. These genes were expressed at high 
levels in all the tissues tested with the exception of the kidney where the expression level was less. The 
two genes are 90% identical at the DNA sequence level, therefore, it is possible that the signal 
observed on the northerns was the result of cross-hybridization and only one of the two genes was 
actually expressed. To address this possibility RT-PCR experiments were carried out on a panel of 
different tissues in order to detect possible tissue dependent expression that would suggest that both 
genes are expressed. Identical, and thus equivocal, results were obtained with both BTF1 and BTF2 
amplification (Figure 4B). 

The second group of genes, BTF3-5, are expressed as three (BTF5) (Figure 4A) and 
two (BTF3 and 4) transcripts ranging from 4.0 to 3.3 kb. BTF5 is expressed at moderate levels in all 
tissues tested with the exception of the kidney where the expression level is less. RT-PCR 
experiments showed that mRNA from the BTF5 gene can be found in all tissues tested, including the 
kidney (Figure 4B). Identical results were obtained with primers from the other genes of this group 
(data not shown). These genes are also 90% identical to each other at the DNA sequence level (but 
only 58% identical to BTF1 and 2), hence like BTF1 and BTF2, cross-hybridization could account for 
the similarity in size and patterns on the northern blots and RT-PCR. This might be particularly true for 
BTF4 which lacks the B30-2 exon but still hybridizes to larger size transcripts like BTF5 and BTF3. 
ii. A aene with sim ilarity to 52 kP Ro/SSA auto-antigen 

Located approximately 120 kb telomeric to the HFE gene is a gene, RoRet, that has 
58% amino acid similarity to the 52 kD Ro/SSA protein, an auto-antigen of unknown function that is 
frequently recognized by antibodies in patients with systemic lupus and Sjogren's syndrome (Anderson 
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et ml. Lancet 2:456-560 (1961); Clark etal. J. Immunol 102:117-122 (1969)) (Figures 1 and 2). 
Alignment of the predicted amino acid sequence of this cDNA with that of 52 kD Ro/SSA indicated two 
features associated with the 52 kD Ro/SSA protein: a putative DNA binding cysteine rich motif 
(C-X-(l > V)-C-X(11-30)-C-X-H-X-(F.I.L)-C-X(2)-C-(i;L.M)-X(10-18)-C-P-X-C) found at the N terminus 
(Freemont et at. CeM 64: 483-484 (1991)) and the B30-2 exon found near the carboxyl terminus, are 
both conserved in RoRet (Figure 5). Northern blot analysis indicated the RoRet gene was expressed 
as two major transcripts of 2.8 and 2.2 kb and two minor transcripts of 7.1 and 4.4 kb in all of the 
tissues on the blot at levels reflective of the RNA amounts as determined by P-actin probing (Figure 
6A). Using RT-PCR, expression can also be detected in small intestine, kidney liver, and spleen 
(Figure 6B). 

iii. Two genes with homo logy to a sodium phosphate trans porter 

A cDNA for a sodium phosphate transport protein (NPT1) was previously cloned and 
mapped to 6p21.3 using a somatic cell hybrid panel (Chong etal. Genomics 18:355-359 (1993)). 
NPT1 maps 320 kb telomeric to the HFE gene (Figures 1 and 2). Two additional cDNAs were cloned 
which show appreciable homology to NPT1 (Figure 5). These genes, NPT3 and NPT4, mapped 1.5 
megabases and 1 .3 megabases centromeric to the NPT1 gene (Figure 1). Like NPT1 , the gene 
products of NPT3 and NPT4 were extremely hydrophobic, which may reflect a membrane location. 
Both proteins gave hydrophilicity profiles which were indistinguishable from NPT1 in this study (data not 
shown). Northern blot analysis indicated that the two genes have different patterns of expression 
(Figure 6C). NPT3 was expressed at high levels as a 7.2 kb transcript predominately in muscle and 
heart. Lesser amount of the mRNA were also found in brain, placenta, lung, liver and pancreas. 
RT-PCR analysis indicated that expression of the proper size PCR fragment for NPT3 was clearly 
absent in fetal brain, bone marrow and small intestine (Figure 6D). A smaller size fragment was 
detectable in all tissues with the exception of the liver, which may represent evidence for alternative 
splicing. Although expression was apparently absent from the kidney by northern blot analysis, it was 
detectable by RT-PCR. Expression was also noted in the mammary gland, spleen and testis. NPT4, 
on the other hand, was expressed only in the liver and the kidney as a smear of transcripts 
approximately 2.6 - 1.7 kb (Figure 6C). RT-PCR confirmed these results, although a small amount of 
the proper size PCR fragment was also found in the small intestine and testis (Figure 6D). Other 
tissues showed amplification, but the fragments were of larger and smaller size than that produced by 
the cDNA 22E positive control. Hence, these two genes which apparently have the structural 
characteristics of a sodium phosphate transporter, appeared to be under the control of different 
regulatory mechanism that lead to differential patterns of expression. 

2 - Sequencing of 235 kb from a Homozygous Ancp ^ tral (Affected) Individual 

In these studies the entire genomic sequence was determined from an HH affected 
individual for a region corresponding to a 235,033 bp region surrounding the HFE gene between the 
flanking markers D6S2238 and D6S2241 . The sequence was derived from a human lymphoblastoid 
cell line, HC14, that is homozygous for the ancestral HH mutation and region. The sequence from the 
ancestral chromosome (Figure 9) was compared to the sequence of the region in an unaffected 
individual (Figure 8) disclosed in copending U.S.S.N. 08/724,394 to identify polymorphic sites. A 



WO 98/14466 



PCT/US97/176S8 



33 

subset of the polymorphic alleles so defined were further studied to determine their frequency in a 
collection of random individuals. 

The cell line HC14 was deposited with the ATCC orh June 25, 1997, and is designated 
ATCC CRL-12371. 

a. Cosmid Library Screening 

The strategy and methodology for sequencing the genomic DNA for the affected 
individual was essentially as described in copending U.S.S.N. 08/724,394, hereby incorporated by 
reference in its entirety. Basically, a cosmid library was constructed using high molecular weight DNA 
from HC14 cells. The library was constructed in the supercos vector (Stratagene, La Jolla, CA). 
Colonies were replicated onto Biotrans nylon filters (ICN) using standard techniques. Probes from 
genomic subclones used in the generation of the sequence of the unaffected sequence disclosed in 
08/724,394 were isolated by gel electrophoresis and electroporation. Subclones were chosen at a 
spacing of approximately 20 kb throughout the 235 kb region. The DNA was labeled by incorporation 
of 32P dCTP by the random primer labeling approach. Positively hybridizing clones were isolated to 
purity by a secondary screening step. Cosmid insert ends were sequenced to determine whether full 
coverage had been obtained, and which clones formed a minimal path of cosmids through the 235 kb 
region. 

b. Sample Sequencing 

A minimal set of cosmid clones chosen to cover the 235 kb region were prepped with 
the Qiagen Maxi-Prep system. Ten micrograms of DNA from each cosmid preparation were sonicated 
in a Heat Systems SonicatorXL and end-repaired with Klenow (USB) and T4 DNA polymerase (USB). 
The sheared fragments were size selected between three to four kilobases on a 0.7% agarose gel and 
then ligated to BstXI linkers (Invitrogen). The ligations were gel purified on a 0.7% agarose gel and 
cloned into a pSP72 derivative plasmid vector. The resulting plasmids were transformed into 
electrocompetent DH5ct cells and plated on LB-carbenicillin plates. A sufficient number of colonies 
was picked to achieve 1 5-fold clone coverage. The appropriate number of colonies was calculated by 
the following equation to generate a single-fold sequence coverage: Number of colonies = size of 
bacterial clone (in kb)/average sequence read length (0.4 kb). These colonies were prepped in the 
96-well Qiagen REAL, and the 5' to 3' DNA Prep Kit, and AGCT end-sequenced with oligo MAPI using 
standard ABI Dye Terminator protocols. MAPI was CGTTAGAACGCGGCTACAAT. 

c. Genomic Sequencing 

The MAPI sequences from the cosmid clones HC182, HC187, HC189, HC195, 
HC199, HC200, HC201 . HC206, HC207. and HC212 were assembled into contigs with the Staden 
package (available from Roger Staden, MRC). A minimal set of 3 kb clones was selected for 
sequencing with oligo labeled MAP2 that sits on the opposite end of the plasmid vector. The sequence 
of MAP2 was GCCGATTCATTAATGCAGGT. The MAP2 sequences were entered into the Staden 
database in conjunction with the MAPI sequences to generate a tiling path of 3 kb clones across the 
region. The plasmid 3 kb libraries were concurrently transformed in 96 well format into pox38UR 
(available from C. Martin, Lawrence Berkeley Laboratories). The transformants were subsequently 
mated with JGM (Strathman et al. P.N.A.S. 88:1247-1250 (1991) in 96 well format. All matings of the 
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3 kb clones within the tiling path were streaked on LB-carbenicillin-kanamycin plates and a random 
selection of 12 colonies per 3 kb clone was prepped in the AGCT system. The oligos -21 : 
CTGTAAAACGACGGCCAGTC, and REV: G C AG G AAAC AGCTATG ACC were used to sequence off 
both ends of the transposon. Each 3 kb clone was assembled in conjunction with the end sequence 
information from all cosmid clones in the region. 

In some regions, the coverage of the genomic sequence by cosmids was incomplete. 
Any gaps in the sequence were filled by using standard PCR techniques to amplify genomic DNA in 
those regions and standard ABI dye terminator chemistry to sequence the amplification products. 

d. Identification of Polymorphic Sites . 

The assembled sequence of the cosmid clones in connection with the PCR amplified 
genomic DNA was compared to the genomic sequence of the unaffected individual using the FASTA 
algorithm. Numeric values were assigned to the sequenced regions of 1 to 235,303, wherein base 1 
refers to the first C in the CA repeat of D6S2238 and base 235,303 is the last T in the GT repeat of 
D6S2241 of the unaffected sequence (Figure 8). Table 1 lists the differences between the two 
compared sequences. Note that previously disclosed (Feder et al., Nature Qgnejiss 13:399-408 
(1996)) polymorphic sites D6S2238 (base 1). D6S2241 (base 235,032), 24d1 (base 41316), and 
D6S2239 (base 84841) are not included in the list of new polymorphisms, although they are provided 
for reference in a footnote to the Table and were observed in the ancestral sequence. In the Table, a 
single base change such as C-T refers to a C in the unaffected sequence at the indicated base position 
that occurred as a T in the corresponding position in the affected sequence. Similarly, an insertion of 
one or more bases, such as TTT in the affected sequence, is represented as "TTT INS" between the 
indicated bases of the unaffected sequence. A deletion of one or more bases occurring in the affected 
sequence, such as AAA DEL, is represented as the deletion of the indicated bases in the unaffected 
sequence. 

e. Characteri zation of Rare Polymorphism* 

In this study about 100 of the polymorphisms of Table 1 were arbitrarily chosen for 
further characterization. Allele frequencies in the general population were estimated by OLA analysis 
using a population of random DNAs (the "CEPH" collection. J. Dausset et al., Genomics 6(3):575-577 
(1990)). These results are provided in Table 2. 

One single base pair difference, occurring at base 35983 and designated 
C182.1G7T/C (an A to G change on the opposite strand) was present in the ancestral chromosome 
and rare in the random DNAs. This change occurred in a noncoding region of the hemochromatosis 
gene near exon 7 approximately 5.3 kb from the 24d1 (Cys282Tyr) mutation. OLA was used to 
genotype 90 hemochromatosis patients for the C182.1G7T/C base pair change. The frequency for C 
occurring at this position in the patients was 79.4% as compared to 5% in the random DNAs. 
Eighty-five of the 90 patients assayed contained identical 24d1 and C182.1G7T/C genotypes. Four of 
the remaining 5 patients were homozygous at 24d1 and heterozygous at C1 82.1 G7T/C; one was 
heterozygous at 24d1 and homozygous at C182.1G7T/C. The primers used for this analysis were as 
follows. 



WO 98/14466 



PCT/US97/17658 



PCR primers for detection: 

182.1G7.F 5'-GCATCAGCGATTAACTTCTAC -3* 
1 82.1 G7.R S'-TTGCATTGTGGTGAAATCAGGG -3' 

For the detection assay, the biotinylated primers used were as follows. 
1 82.1 G7.C 5' (b)CTGAGTAATTGTTTAAGGTGC -3' 
182.1G7.T 5* (b)CTGAGTAATTGTTTAAGGTGT -3' 

The phosphorylated digoxigenin-labeled primer used was: 

1 82.1 G7.D 5' (p)AGAAGAGATAGATATGGTGG -3' 



A further rare single base pair change was detected at 61 ,465bp. The inheritance 
pattern of this polymorphism, C195.1 H5C/T (a G to A change on the opposite strand), is identical to 
that of 24d1 . The frequency of T occurring at that position (C1 95,1 H5T) observed in a set of 76 
patients was 78.5% as compared to 5% in random individuals. 



PCR primers for detection: 

1 951 H5.3F 5 , -GAATGTGACCGTCCCATGAG-3 t 

1 951 H5.3R 5 -CAACTGAATATGCAGAAAAAAGTACACC-3' 

For the detection assay, the biotinylated primers used were: 

1 951 H5.3.4 5' (b)AGTAGCTGGGACTCACGGTGT-3' 
1957H5.3.5 5' (b)AGTAGCTGGGACTCACGGTGC-3* 

The phosphorylated digoxigenin-labeled primer used was: 

1951H5.3.6 5' (p)GCGCCACCACTCCCAGCTCAT-3' 



These rare alleles are thus preferred surrogate markers for 24d1 and are especially 
useful in screening assays for the likely presence of 24d1 and/or 24d2. 

All publications, patents, and patent applications cited herein are hereby incorporated 
by reference in their entirety. 
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WHAT IS CLAIMED IS - 

1 ■ An oligonucleotide comprising at least 8 to about 1 00 consecutive bases from the 
sequence of Figure 9, or the complement of the sequence, wherein the at least 8 to about 1 00 
consecutive bases includes at least one polymorphic site of Table 1 . 

2. The oligonucleotide of claim 1 , wherein the polymorphic site is selected from the 
group consisting of base 35983 or base 61465. 

3. An oligonucleotide pair selected from the sequence of Figure 9 or its complement for 
amplification of a polymorphic site of Table 1 . 

4. An isolated nucleic acid molecule comprising about 100 consecutive bases to about 
235 kb substantially identical to the sequence of Figure 9, wherein the DNA molecule comprises at 
least one polymorphic site of Table 1. 

5. The isolated nucleic acid molecule of claim 4, wherein the polymorphic site is selected 
from the group consisting of base 35983 or base 61465. 

6. The isolated nucleic acid molecule of claim 4, wherein the nucleic acid is selected 
from the group consisting of cDNA, RNA, or genomic DNA. 

7. A polypeptide encoded by the nucleic acid molecule of claim 4. 

8. An antibody which specifically recognizes the polypeptide of claim 7. 

9. A method to determine the presence or absence of the common hereditary 
hemochromatosis (HFE) gene mutation in an individual comprising: 

providing DNA or RNA from the individual; and 

assessing the DNA or RNA for the presence or absence of a haplotype of Table 1 , 
wherein, as a result, the absence of a haplotype of Table 1 indicates the likely absence of the 
HFE gene mutation in the genome of the individual and the presence of the haplotype indicates the 
likely presence of the HFE gene mutation in the genome of the individual. 

10. The method of claim 9, wherein the method further comprises assessing the RNA or 
DNA for the presence of at least one of the polymorphisms 24d1 , 24d2, HHP-1 , HHP-1 9. or HHP-29; 
or microsatellite repeat alleles 1909:205, 18B4:235, 1A2:239, 1E4:271, 24E2:245, 2B8:206, 3321- 
1:98,4073-1:182,4440-1:180.4440-2:139. 731-1:177. 5091-1:148. 3216-1:221. 4072-2:170. 950- 
1:142. 950-2:164. 950-3:165. 950-4:128. 950-6:151. 950-8:137. 63-1:151. 63-2:113. 63-3:169 65- 
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6 1:206, 65-2:159, 68-1:167, 241-5:108, 241-29:113, 373-8:151, 373-29:113, D6S258:199, D6S265:122, 

7 D6S1 05:124, D6S306:238, D6S464:206, or D6S1001:180. 

1 11. The method of claim 9, wherein the haplotype comprises at least two polymorphic 

2 sites of Table 1. 

1 1 2. The method of claim 1 1 , wherein one of the at least two polymorphic sites of Table 1 

2 is at base 35983 or 61 465. 

1 1 3. The method of claim 1 1 , wherein the haplotype comprises at least three polymorphic 

2 sites of Table 1. 

1 1 4. A method to determine the presence or absence of the common hereditary 

2 hemochromatosis (HFE) gene mutation in an individual comprising: 

3 providing DIMA or RNA from the individual; and 

4 assessing the DNA or RNA for the presence or absence of a genotype defined by a 

5 polymorphic allele of Table 1 , 

6 wherein, as a result, the absence of a genotype defined by a polymorphic allele of Table 1 

7 indicates the likely absence of the HFE gene mutation in the genome of the individual and the 

8 presence of the genotype indicates the likely presence of the HFE gene mutation in the genome of the 

9 individual. 

1 1 5. The method of claim 1 5, wherein the polymorphic allele occurs in less than about 50% 

2 of a random population of individuals. 

1 1 6. The method of claim 1 5, wherein the polymorphic allele occurs in less than about 25% 

2 of a random population of individuals. 

1 1 7. The method of claim 1 5, wherein the polymorphic allele occurs in less than about 5% 

2 of a random population of individuals. 

1 1 8. The method of claim 1 5, wherein the genotype is C1 82.1 G7C or C1 95.1 H5T. 

1 19. A kit comprising one or more oligonucleotides of claim 1 . 

1 20. A kit comprising at least one oligonucleotide pair of claim 3. 

1 21 . A culture of lymphoblastoid cells having the designation ATCC CRL-12371 
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22. An isolated nucleic acid sequence comprising a sequence substantially identical to 



BTF1. 



BTF2. 



BTF3. 



23. The isolated nucleic acid sequence of claim 23, wherein the nucleic acid is cDNA. 

24. The polypeptide encoded by the isolated nucleic acid sequence of claim 23. 

25. A vector comprising the nucleic acid sequence of claim 23. 

26. A host cell stably transfected with the nucleic acid sequence of claim 23. 

27. An antibody that is specifically immunoreactive with the polypeptide of claim 24. 

28. An isolated nucleic acid sequence comprising a sequence substantially identical to 

29. The isolated nucleic acid sequence of claim 28, wherein the nucleic acid is cDNA. 

30. The polypeptide encoded by the isolated nucleic acid sequence of claim 28. 

31 . A vector comprising the nucleic acid sequence of claim 28. 

32. A host ceil stably transfected with the nucleic acid sequence of claim 28. 

33. An antibody that is specifically immunoreactive with the polypeptide of claim 30. 

34. An isolated nucleic acid sequence comprising a sequence substantially identical to 

35. The isolated nucleic acid sequence of claim 34, wherein the nucleic acid is cDNA. 

36. The polypeptide encoded by the isolated nucleic acid sequence of claim 34. 

37. A vector comprising the nucleic acid sequence of claim 34. 

38. A host cell stably transfected with the nucleic acid sequence of claim 34. 

39. An antibody that is specifically immunoreactive with the polypeptide of claim 36. 
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40. An isolated nucleic acid sequence comprising a sequence substantially identical to 



BTF4. 



BTF5. 



NTP-3. 



41 . The isolated nucleic acid sequence of claim 40, wherein the nucleic acid is cDNA. 

42. The polypeptide encoded by the isolated nucleic acid sequence of claim 40. 

43. A vector comprising the nucleic acid sequence of claim 40. 

44. A host cell stably transfected with the nucleic acid sequence of claim 40. 

45. An antibody that is specifically immunoreactive with the polypeptide of claim 42. 

46. An isolated nucleic acid sequence comprising a sequence substantially identical to 

47. The isolated nucleic acid sequence of claim 46, wherein the nucleic acid is cDNA. 

48. The polypeptide encoded by the isolated nucleic acid sequence of claim 46. 

49. A vector comprising the nucleic acid sequence of claim 46. 

50. A host cell stably transfected with the nucleic acid sequence of claim 46. 

51 . An antibody that is specifically immunoreactive with the polypeptide of claim 48. 

52. An isolated nucleic acid sequence comprising a sequence substantially identical to 

53. The isolated nucleic acid sequence of claim 52, wherein the nucleic acid is cDNA. 

54. The polypeptide encoded by the isolated nucleic acid sequence of claim 52. 

55. A vector comprising the nucleic acid sequence of claim 52. 

56. A host cell stably transfected with the nucleic acid sequence of claim 52. 

57. An antibody that is specifically immunoreactive with the polypeptide of claim 54. 



RoRet. 
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58. An isolated nucleic acid sequence comprising a sequence substantially identical to 

NTP-4. 

59. The isolated nucleic acid sequence of claim 58, wherein the nucleic acid is cDNA. 

60. The polypeptide encoded by the isolated nucleic acid sequence of claim 58. 

61 . A vector comprising the nucleic acid sequence of claim 58. 

62. A host cell stably transfected with the nucleic acid sequence of claim 58. 

63. An antibody that is specifically immunoreactive with the polypeptide of claim 60. 

64. An isolated nucleic acid sequence comprising a sequence substantially identical to 

65. The isolated nucleic acid sequence of claim 64, wherein the nucleic acid is cDNA. 

66. The polypeptide encoded by the isolated nucleic acid sequence of claim 64. 

67. A vector comprising the nucleic acid sequence of claim 64. 

68. A host cell stably transfected with the nucleic acid sequence of claim 64. 

69. An antibody that is specifically immunoreactive with the polypeptide of claim 66. 

70. An isolated nucleic acid sequence comprising at least 1 8 contiguous nucleotides 
substantially identical to 18 contiguous nucleotides of BTF1 . 

71 . An isolated nucleic acid sequence comprising at least 1 8 contiguous nucleotides 
substantially identical to 18 contiguous nucleotides of BTF2. 

72. An isolated nucleic acid sequence comprising at least 1 8 contiguous nucleotides 
substantially identical to 18 contiguous nucleotides of BTF3. 

73. An isolated nucleic acid sequence comprising at least 1 8 contiguous nucleotides 
substantially identical to 18 contiguous nucleotides of BTF4. 

74. An isolated nucleic acid sequence comprising at least 1 8 contiguous nucleotides 
substantially identical to 18 contiguous nucleotides of BTF5. 
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75. An isolated nucleic acid sequence comprising at least 1 8 contiguous nucleotides 
substantially identical to 18 contiguous nucleotides of NPT3. 

76. An isolated nucleic acid sequence comprising at least 18 contiguous nucleotides 
substantially identical to 18 contiguous nucleotides of NPT4. 

77. An isolated nucleic acid sequence comprising at least 1 8 contiguous nucleotides 
substantially identical to 18 contiguous nucleotides of RoRet. 
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*Zl± ME SAAALH FS R PAS LLLLLLSLCALVSAQFI WGPTDPI LATVGENTTLRCHLS PE 

"til MEPAAALHFSLPASLLLLLLLLLLSLC^LVSAQFTVVGPANPILAMVGENTTLRCHLSPE 

BTF5 MKMASFLAFLLLNFR— VCLLLLQLLMPHSAQFSVLGPSGPILAMVGEDADLPCHLFPT 

SIS MKMASSLAFLLLNFH VSLFLVQLLTPCSAQFSVLGPSGPILAMVGEDADLPCHLFPT 

BTF4 MKMASSLAFLLLNFH VSLLLVQLLTPCSAQFSVLGPSGPI LAMVGEDADLPCHLFPT 
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BT ASAEHLE LRWFRKKVS PAVLVHRDGREQEAEQMPE YRGRATLVQDGI AKGR VALR I RGVR 

BTF1 KNAEDMEVRWFRSQFSPAVFVYKGGRERTEEQMEEYRGRTTFVSKDISRGSVALVIHNIT 

BTF5 
BTF3 
BTF4 



KNAEDMEVRWFRSQFSPAVFVYKGGRERTEEQMEEYRGRITFVSKDINRGSVALVIHNVT 
MSAETMELKWVS SSLRQWNVYADGKEVEDRQSAPYRGRTS I LRDGI TAGKAALRI HNVT 

MSAETMELRWVSSSLRQVVNVYADGKEVEDRQSAPYRGRTSILRDGITAGKAALRIHNVT 
MSAETME LKWVS SSLRQWNVYADGKEVEDRQSAPYRGRTS I LRDGI TAGKAALRI HNVT 
** •*••* * * *.* * **** . . ***** 



• VSDDGE YTCFFREDGS YEEALVHLKVAALGSDPHI SMQVQENGE I CLECTS VGWY PE POV 
BTF1 AQENGTYRCYFQEGRSYDEAI LHLWAGLGSKPLISMRGHEDGGIRLECISRGWYPKPLT 

AQENGIYRCYFQEGRSYDEAILRLWAGLGSKPLIEIKAQEDGSIWLECISGGWYPEPLT 
ASDSGKYLCYFQDGDFYEKALVELKVAALGSDLHVDVKGYKDGGIHLECRSTGWYP0POI 
ASDSGKYLCYFQDGDFYEKALVELKVAALGSDLHIEVKGYEDGGIHLECRSTGWYPQPOI 
ASDSGKYLCYFQDGDFYEKALVELKVAALGSNLHVEVKGYEDGGIHLECRSTGWYPQPQI 



*. *.. * ** *** _ * * t ^ 



QWRTSKGEKFPSTSESRNPDEEGLFTVAASVIIRDTSTKNVSCYIQNLLLGQEKKVEISI 
VWRDPYGGVAPALKEVSMPDADGLFMVTTAVIIRDKSVRNMSCSINNTLLGQKKESVIFI 
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KWSDTKGENIPAVEAPWADGVGLYAVAASVIMRGSSGGGVSCI IRNSLLGLEKTASISI 
QWSNAKGENI PAVEAPWADGVGLYEVAASVIMRGGSGEGVSCI I RNS LLGLE KTAS IS I 



EL. PASSLPRLTPWIVAVAV ILMVLGLLTIGSI FFTWRLYNER 

PESFMPSVSPCAVALP IIWILMIPIAVCIYWINKLQKEKKILSGEK 

PES FMPSAS PWMVALAVI LTAS PWMVSMTVI LAV FI I FMAVS I CC I KKLQREKKI LS GE K 

ADPFFRSAQRWIAALAR TLPVLLLLLGGAGYFLWQQQEEKKTQFRKK 

ADPFFRSAQPWIAALAG TLPISLLLLAGASYFLWRQQKEKIALSRET 

ADPFFRSAQPWIAALAG TLPILLLLLAGASYFLWRQQKEITALSSEI 
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°TF1 E FERETRE IALKE LEKERVQKEEELQVKEKLQEE LRWRRT FLHA 
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till EREREMKEMGYAATEQEIS LREKLQEELKWRKIQYMARGEKSLAYHEWKMALF 

ESEQEMKEMGYAATEREIS LRESLQEELKRKKSST 



★ 
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— VDWLDPDTAHPDLFLSEDRRSVRRCPFRHLGESVPDNPERFDSQPCVLGRESFASGK 
— ADWLDPDTAHPELFLSEDRRSVRRGPYRQR— VPDNPERFDSQPCVLGWESFASGK 
KPADVILDPKTANPILLVSEDQRSVQRAKE PQD LPDNPERFNWHYCVLGCESFISGR 

KPADVILDPDTANAILLVSEDQRSVQRAEEPRD— LPDNPERFEWRYCVLGCENFTSGR 
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HYWEVEVGDRKEWHIGVCSKNVQRK-GWVKMTPENGFWTMGLTDGNKYRTLTEPRTNLKL 
HYWEVEVGDRKEWHIGVCSKNVERKKGWVKMTPENGYWTMGLTDGNKYRALTEPRTNLKL 
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AGPPRRVGIFLDYESGDISFYNMNDGSDIYTFSWTFSGPLRPFFCLWSSGKKPLTICPI 
KESLCRVGVFLDYEAGDVSFYNMRDRSHIYTCPRSAFSVPVRPFFRLGC— EDSPIFICPA 
KESLCRVGVFLDYEAGDVSFYNMRDRSHIYTCPRSAFTVPVRPFFRLGS-DDSPIFICPA 
PKPPKKVGVFLDYETGDISFYNAVDGSHIHTFLDV.SFSEALYPVFRILTLEPTALSICPA 
PEPPRKVGIFLDYETGEISFYNATDGSHI YTFPHASFSEPLYPX'FRILTLE PTALTICPI 
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cgacccacgcgtccgaacatggcgacctaggagaaagggaagaacaattttttctcctcttttgggaagg 

tttgcgtctagtagtgcctgtgcccctgggcagattggagagaagagggacgactggagaatcgtcgaga 

accagcggagaaaagaaaaagcaacgtttaattctagaaggcctcctgtccctgcctgctctgggtgctc 

atggaatcagctgctgccctgcacttctcccggccagcctccctcctcctcctcctcctcagcctgtgta 

cactggtctcagcccagtttattgtcgtggggcccactgatcccatcttggccacggttggagaaaacac 

tacgttacgctgccatctgtcacccgagaaaaatgctgaggacatggaggtgcggtggttccggtctcaq 

ttctcccccgcagtgtttgtgtataaaggtggcagagagagaacagaggagcagatggaggagtaccgaq 

gaagaaccacctttgtgagcaaagacatcagcaggggcagcgtggccctggtcatacacaacatcacagc 

ccaggaaaacggcacctaccgctgttacttccaagaaggcaggtcctacgatgaggccatcctgcacctc 

gtagtggcaggactaggctctaagcccctcatttcaatgaggggccatgaagacgggggcatccggctgq 

agtgcatatctagagggtggtacccaaagcccctcacagtgtggagggacccctacggtggggttgcqcc 

tgccctgaaagaggtctccatgcctgatgcagacggcctcttcatggtcaccacggctgtgatcatcaqa 

gacaagtctgtgaggaacatgtcctgctctatcaacaacaccctgctcggccagaagaaagaaagtgtca 

tttttattccagaatcctttatgcccagtgtgtctccctgtgcagtggccctgcctatcattgtqgttat 

tctgatgatacccattgccgtatgcatctattggatcaacaaactccaaaaggaaaaaaagattctgtca 

ggggaaaaggagtttgaacgggaaacaagagaaattgctctaaaggaactggagaaagaacgtgtgcaaa 

aagaggaagaacttcaagtaaaagagaaacttcaagaagaattgcgatggagaagaacattcttacatgc 

tgttgatgtggtcctggatccagacaccgctcatcccgatctcttcctgtcagaggaccggagaagtgtg 

agaaggtgccccttcaggcacctaggggagagcgtgcctgacaacccagagagattcgacagtcagcctt 

gtgtcctaggccgggagagcttcgcttcagggaaacattactgggaggtggaggtggaaaacgtgattga 

gtggactgtgggggtctgtagagacagtgttgagaggaaaggggaggtcctgctgattcctcagaatggc 

ttctggaccttggagatgcataaagggcaataccgggccgtgtcctcccctgataggattctccctttga 

aggagtccctttgccgggtgggcgtcttcctggactatgaagctggagatgtctccttctacaacatgag 

ggacagatcgcacatctacacatgtccccgttcagccttttccgtgcctgtgaggcccttcttcaggttg 

gggtgtgaggacagccccatcttcatctgccctgcactcacaggagccaatggggtcacggtgcctgaag 

agggcctgacacttcacagagtggggacccaccagagcctatagaatcaattccttggtctcacagccat 

gtagacaagccctggtcatctcagcagccaccgcacaacacccctggtggaagacacgccctcctcccct 

ctggtcacacaagagaacatcttccagctgcctctttcacacccactacagacctcagccccagttttct 

cctcctcactaggctgtgtttttagtagttcctttgcttgtaactatgggatgggatccaggcataggga 

actagttgttacacagctcccagccaagaagaaagtgtgagaagttgatgggcagcaaacctgctgttta 

acatcagggtgaccacattaagcccagtattccagttggcaccagaagatatggacttggaatgaggcct 

acagggttcaccaggatgtaagaggagagaggaatccacaggaccaccagagaggagagggaaccagata 

tgcagatcagagatagaggaagtggaaccagagagctgggagggaccaaggttgtaagggtggctaagtc 

ccaccataacagctaaggggacctgggagatgatggctcatttccacccagccccaggatttccagagcg 

cacatccacaggcctggacctgggatgaagatgaatgaagaacatggatgcacgtggatgtagtttggct 

caggtgtccctgcagttggcaaggagtcagtactcagtccctgagtgtggctgaaatttgaggtcctggc 

tgagccaaggagtaatggaccagatctacctcagtattcaagttcagtggggacaccagtggcttcaaac 

ttcctggtttcatgatatcttgagacgccttacaaatgatggaggattccaaagagtttttgtttatttq 

ggttaatatttgttggtatttatggcatttgagattgaaactaagaaatgttttaatttattacctttac 

aacatttatttacattacatacatacatttacaacatttattaatttatattaaaatagcatgaataagc 

caattataggttaatataagtagaatgtttgtgaaaaataagtatggtatccaaagcaaaataaatttta 
ttgtgaagtgtgaaaaaaaaaaaaaaaaaaaaaa 
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acgcgtccgcttcggaatgagagactcaaccataatagaaagaatggagaactattaaccaccattcttc 
agtgggctgtgattttcagaggggaatactaagaaatggttttccatactggaacccaaaggtaaagaca 
ctcaaggacagacatttttggcagagcatagatgaaaatggcaagttccctggctttccttctgctcaac 
tttcatgtctccctcttcttggtccagctgctcactccttgctcagctcagttttctgtgcttggaccct 
ctgggcccatcctggccatggtgggtgaagacgctgatctgccctgtcacctgttcccgaccatgagtgc 
agagaccatggagctgaggtgggtgagttccagcctaaggcaggtggtgaacgtgtatgcagatggaaaq 
gaagtggaagacaggcagagtgcaccatatcgagggagaacttcgattctgcgggatggcatcactgcaq 
ggaaggctgctctccgaatacacaacgtcacagcctctgacagtggaaagtacttgtgttatttccaaga 
tggtgacttctacgaaaaagccctggtggagctgaaggttgcagcattgggttctgatcttcacattgaa 
gtgaagggttatgaggatggagggatccatctggagtgcaggtccactggctggtacccccaaccccaaa 
;^^ t ? gagC ? acaccaagggagagaacatccc gg ct g t gg aa g ca cctgtggttgcagatggagtggg 
cctgtatgcagtagcagcatctgtgatcatgagaggcagctctggtgggggtgtatcctgcatcatcaga 
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aattccctcctcggcctggaaaagacagccagcatatccatcgcagaccccttcttcaggagcgcccagc 
cctggatcgcggccctggcagggaccctgcctatctcgttgctgcttctcgcaggagccagttacttctt 
gtggagacaacagaaggaaaaaattgctctgtccagggagacagaaagagagcgagagatgaaagaaatg 
ggatacgctgcaacagagcaagaaataagcctaagagagaagctccaggaggaactcaagtggaggaaaa 
tccagtacatggctcgtggagagaagtctttggcctatcatgaatggaaaatggccctcttcaaacctgc 
ggatgtgattctggatccagacacggcaaacgccatcctccttgtttctgaggaccagaggagtgtgcag 
cgtgctgaagagccgcgggatctgccagacaaccctgagagatttgaatggcgttactgtgtccttggct 
gtgaaaacttcacatcagggagacattactgggaggtggaagtgggggacagaaaagagtggcatattgg 
ggtatgtagtaagaacgtggagaggaaaaaaggttgggtcaaaatgacaccggagaacggatactggact 



atctcatatctacacctttccgcacgcctctttctctgagcctctatatcctgttttcagaattttgacc 
ttggagcccactgccctgaccatttgcccaataccaaaagaagtagagagttcccccgatcctgacctag 
tgcctgatcattccctggagacaccactgaccccgggcttagctaatgaaagtggggagcctcaggctga 
agtaacatctctgcttctccctgcccaccctggagctgaggtctccccttctgcaacaaccaatcagaac 
cataagctacaggcacgcactgaagcactttactgatattcattccattattccatatgacagttgtttt 
gagtttcgtaccaccttattgtccccttatacagataaggaaactggggtgcagaaaggtgaattaactt 
tacaaagtagacatgacaagtgaacagcagagctgggatctaaacagcaataactaacattaacagagaa 
tttaaaatgttcttagtgctgtgttataagctttggtggatgtcactcctttaatcctcacaacaccctg 
tcgggtagtcatattttgcaagtatggaagctgaggcagggcaacatgaagtaacttacataattcatac 
agtaatttgtgcagttgggagatgttcagccttagtccctggctaattgcctgttcttttccagcctgat 
tttttttcccacaggaagagcccacatgtagccctgaggtttccttcccaggacagctgcagggtagaga 
tcattttaagtgcttgtggagttgacatccctattgactctttcccagctgatatcagagacttagaccc 
agcactccttggattagctctgcagagtgtcttggttgagagaataacctcatagtaccaacatgacatg 
tgacttggaaagagactagaggccacacttgataaatcatggggcacagatatgttcccacccaacaaat 
gtgataagtgattgtgcagccagagccagccttccttcaatcaaggtttccaggcagagcaaatacccta 
gagattctctgtgatataggaaatttggatcaaggaagctaaaagaattacagggatgtttttaatccca 
ctatggactcagtctcctggaaataggtctgtccactcctggtcattggtggatgttaaacccatattcc 
tttcaactgctgcctgctagggaaaactgctcctcattatcatcactattattgctcaccactgtatccc 
ctctacttggcaagtggttgtcaagttctagttgttcaataaatgtgttaataatgaaaaaaaaaaa 



>CDNA23 

atttgctttctctttttcctttcttccggatgagaggctaagccataatagaaagaatggagaattattg 
attgaccgtctttattctgtgggctctgattctccaatgggaataccaagggatggttttccatactgga 
acccaaaggtaaagacactcaaggacagacatttttggcagagcatagatgaaaatggcaagttccctgg 
ctttccttctgctcaactttcatgtctccctcctcttggtccagctgctcactccttgctcagctcagtt 
ttctgtgcttggaccctctgggcccatcctggccatggtgggtgaagacgctgatctgccctgtcacctg 
ttcccgaccatgagtgcagagaccatggagctgaagtgggtaagttccagcctaaggcaggtggtgaacg 
tgtatgcagatggaaaggaagtggaagacaggcagagtgcaccgtatcgagggagaacttcgattctgcg 
ggatggcatcactgcagggaaggctgctctccgaatacacaacgtcacagcctctgacagtggaaagtac 
ttgtgttatttccaagatggtgacttctatgaaaaagccctggtggagctgaaggttgcagcactgggtt 
ctaatcttcacgtcgaagtgaagggttatgaggatggagggatccatctggagtgcaggtccaccggctg 
gtacccccaaccccaaatacagtggagcaacgccaagggagagaacatcccagctgtggaagcacctgtg 
gttgcagatggagtgggcctatatgaagtagcagcatctgtgatcatgagaggcggctccggggagggtg 
tatcctgcatcatcagaaattccctcctcggcctggaaaagacagccagcatttccatcgcagacccctt 
cttcaggagcgcccagccctggatcgcagccctggcagggaccctgcctatcttgctgctgcttctcgcc 
ggagccagttacttcttgtggagacaacagaaggaaataactgctctgtccagtgagatagaaagtgagc 
aagagatgaaagaaatgggatatgctgcaacagagcgggaaataagcctaagagagagcctccaggagga 
actcaagaggaaaaaatccagtacttgactcgtggagaggagtcttcgtccgataccaataagtcagcct 
gatgctctaatggaaaaatggccctcttcaagcctggtgaggaaatgcttcagatgaggctccaccttgt 
taaataaattggatgtatggaaaaatagactgcagaaaaggggaactcatttagctcacgagtggtcgag 
tgaagattgaaaattaacctctgagggccagcacagcagctcatgcctgtaatcctagcactttggaagg 
ctgaggagggcggatcacaaggtcaggagatcaagaccatcctggctaacacggtgaaaccccgtctcta 
ctaaaaatacaaaaaataaaaaattagccgggcatggtgacgggcacctgtagtcccagctactcgggag 
gctgaggcaggagaatggcatgaacccggaaggcagagcttgcagtgagccgagatcacgccactgcact 
ccagcctgggagacagagcgagactctgtctcaagaaaaaaaaaaaaaaaaaaaaa 
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>CDNA4 4 



ctgaagcttgcatgcctgcaggtcgacccacgcgtccgcggacgcgtgggcggacgcgtgggtttttcct 

ttcttccagaaggagatttaaccatagtagaaagaatggagaactattaactgccttccttctgtgggct 

gtgattttcagaggggaatgctaagaggtgattttcaatgttgggactcaaaggtgaagacactgaagga 

cagaatttttggcagaggaaagatcttcttcggtcaccatacttgagttagctctagggaagtggaggtt 

tccatttggaattctatagcttcttccaggtcatagtgtctgccccccaccttccagtatctcctgatat 

gcagcatgaatgaaaatggcaagtttcctggccttccttctgctcaactttcgtgtctgcctccttttqc 

ttcagctgctcatgcctcactcagctcagttttctgtgcttggaccctctgggcccatcctggccatqat 

gggtgaagacgctgatctgccctgtcacctgttcccgaccatgagtgcagagaccatggagctgaaatqa 

gtgagttccagcctaaggcaggtggtgaacgtgtatgcagatggaaaggaagtggaagacaggcagagtg 

caccgtatcgagggagaacttcgattctgcgggatggcatcactgcagggaaggrtgctctccgaataca 

caacgtcacagcctctgacagtggaaagtacttgtgttatttccaagatggtgarttctatgaaaaagcc 

ctggtggagctgaaggttgcagcactgggttctgatcttcacgttgatgtgaagggttacaaggatggag 

ggatccatctggagtgcaggtccactggctggtacccccaaccccaaatacagtggagcaacaacaaqqq 

agagaacatcccgactgtggaagcacctgtggttgcagacggagtgggcctgtatgcagtagcagcatct 

gtgatcatgagaggcagctctggggagggtgtatcctgtaccatcagaagttccctcctcgqcctqqaaa 

agacagccagcatttccatcgcagaccccttcttcaggagcgcccagaggtggatcgccgccctgScacg 

gaccctgcctgtcttgctgctgcttcttgggggagccggttacttcctgtggcaacagcaggaggaaaaa 

aagactcagttcagaaagaaaaagagagagcaagagttgagagaaatggcatggagcacaatgaagcaaq 

aacaaagcacaagagtgaagctcctggaggaactcagatggagaagtatccagtatgcatctcggggaqa 

gagacattcagcctataatgaatggaaaaaggccctcttcaagcctgcggatgtgattctggatccaaaa 

acagcaaaccccatcctccttgtttctgaggaccagaggagtgtgcagcgtgccaaggagccccaggatc 

tgccagacaaccctgagagatttaattggcattattgtgttctcggctgtgagagcttcatatcagggag 

acattactgggaggtggaggtaggggacaggaaagagtggcatataggggtgtgcagtaagaatgtgcag 

agaaaaggctgggtcaaaatgacacctgagaatggattctggactatggggctgactgatgggaataagt 

atcggactctaactgagcccagaaccaacctgaaacttcctaagccccctaagaaagtgggggtcttcct 

ggactatgagactggagatatctcattctacaatgctgtggatggatcgcatattcatactttcctggac 

gtctccttctctgaggctctatatcctgttttcagaattttgaccttggagcccacggccctgagtattt 

gtccagcgtgaaaagaagaagagagttcctccaattctgaccgagtgctgatcattccctagagacacca 

gtaaccccgggcttagctaacgaaagtggggagcctcaggctgaagtaacttttctctgcttctccctgc 

ccagctcagagctgagggcctccccctccacagcaaccaatcacaaccataaagctacaagcacgcactq 

aagcactttactgatactcattcaattattcatatgacagttgtttgagtttggtaccatcttattttcc 

ccttatacagataaggaaactggggtgcagaaaagtgaattgactacaaagtagacatgactagttaaca 

acacagctgggatctaaacagcaataactaacattaatggagaacttaaaatgctctgagtgctgtqtta 

tgagctttggtggatgtcactcctttaatcctcgcaacaccctgtcgggtagtctcatttagcaagtatq 

gaagttgaggcagggcaacattaagcaacttacataactcatgcagtaatttctgcagttgqqaqatqtt 

cagcttcagtccccggccctatggccgttcttttccaccctgtttcttcccccataggaagaacccacct 

gtagccctgaggttcttttcccaggatggctccaggataaggatcactgtaggtggttgtggaqttqaca 

cccctgttgactccttcccagctgattgtcagagccttagacccagcacgccttggattagctttgcaga 

gtgtcttggttgagagaataacctcaccgtacccacatgacacgtgatttggaaagagactagaggccac 

acttgataaatcatggggaacagatgtgttccacccaacaaatgtgataagtgatcatgcagccagagcc 

agccttccttcaatcaaggtttccaggcagagcaaataccctagagattttctgtgatataggaaatttq 

gatgaagggagctagaagaaatacagggatttttttttttttttaagatggagtcttactctgttgctaa 

gctggagtgcagtggtgcgatctcagctccctgcaacctccacctcctgggttcaaacaattctcctqcc 

tcagcctcccgagtactgggaatataggtgcacgccaccacacccaacaaatttttgtacttttaqtaca 

gatgagggttcactatgttggccaggatggtctcgatctcttgacctcatgatccacccacctcgqtctc 

ccaaagtgctgggattacaggcttgagccaccgggtgaccggcttacagggatatttttaatcccqttat 

ggactctgtctccaggagaggggtctatccacccctgctcattggtggatgttaaaccaatattcctttc 

aactgctgcctgctagggaaaaactactcctcattatcatcattattattgctctccactqtatcccctc 

tacctggcatgtgcttgtcaagttctagttgttcaataaatttgttaataatgctgaaaaaaaaaaaaaa 
aaaaaaaaaaaaaaaaaaaaaa 



>CDNA32 



agagaacaggtcccagataccgagtccgcaaccccaaacatcgcgattaataggaggcctctqqtctctq 
cctgccctgggtgctcatggaaccagctgctgctctgcacttctccctgccagcctccctcctcctcctc 
ctgctcctcctccttctcagcctgtgtgcactggtctcagcccagtttactgtcgtggggccagctaatc 
ccatcctggccatggtgggagaaaacactacattacgctgccatctgtcacccgagaaaaatgctgagga 
catggaggtgcggtggttccggtctcagttctcccccgcagtgtttgtgtataagggtgggalagaglga 
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acagaggagcagatggaggagtaccggggaagaatcacctttgtgagcaaagacatcaacaggggcagcg 

tggccctggtcatacataacgtcacagcccaggagaatgggatctaccgctgttacttccaagaaggcag 

gtcctacgatgaggccatcctacgcctcgtggtggcaggccttgggtctaagcccctcattgaaatcaag 

gcccaagaggatgggagcatctggctggagtgcatatctggagggtggtacccagagcccctcacagtgt 

ggagggacccctacggtgaggttgtgcccgccctgaaggaggtttccatcgctgatgctgacggcctctt 

catggtcaccacagctgtgatcatcagagacaagtatgtgaggaatgtgtcctgctctgtcaacaacacc 

ctgctcggccaggagaaggaaactgtcatttttattccagaatcctttatgcccagcgcatctccctgga 

tggtggccctagctgtcatcctgaccgcatctccctggatggtgtccatgactgtcatcctggctgtttt 

catcatcttcatggctgtcagcatctgttgcatcaagaaacttcaaagggaaaaaaagattctgtcaggg 

gaaaagaaagttgaacaagaggaaaaagaaattgcacagcaacttcaagaagaattgcgatggagaagaa 

cattcttacatgctgctgatgtggtcctggatccagacaccgctcatcccgagctcttcctgtcagagga 

ccggagaagtgtgaggcggggcccctacaggcagagagtgcctgacaacccagagagattcgacagtcag 

ccttgtgtcctgggatgggagagcttcgcctcagggaaacattactgggaggtggaggtggaaaacgtga 

tggtgtggactgtgggggtctgcagacacagtgttgagaggaaaggggaggtcctgctgattcctcagaa 

tggcttctggaccctggagatgtttggaaaccaataccgggccctgtcctcccctgagaggattctccct 

ttgaaggagtccctttgccgggtgggcgtcttcctggactatgaagctggagatgtctccttctacaaca 

tgagggacagatcacacatctacacatgtccccgttcagcctttactgtgcctgtgaggcccttcttcag 

gttagggtctgatgacagccccatcttcatctgccctgcactcacaggagccagtggggtcatggtgcct 

gaagagggcctgaaacttcacagagtggggacccaccagagcctatagaatcaattccttggactcacag 

ccatgcagataagccctggccatctcagcagccaccgcacaacccccctaatgaaagacacgccctcctc 

ccctctggtcacgtaagagaacatcttccagctgcctttttcacacccactccagccctctgccccagtt 

ttctcctcctcactagtctgtggctttagtagttcctttgcttgtaattatgggatgggatccaggcata 

gggaactagttgtttcatagctcccagtcaaaaagaaagtgagagaagctgttgggcagtgaacctactg 

tttaaaatcaggataaccacattaagcccaatatgccagttggcaccagatgctgtggacttggaatgag 

gccaacagggttcaccaggatgagagaggagagaggaatccacaggaccaccagaagggagagggaacca 

gatatgcagatcagagatagaggaagtggaaccagagagctgggagggaccaaggttgtaaggatggcta 

agtcccaccataagagctaaagggtcctgggagatgatggctcatttccacccaaccccaggatttccac 

agcacacacccacaggcctggacctgggatgaagatgaatgaagaacatggactcatgtggatgtggttt 

ggctcagatgtccctgcaataaacaaggggtcagtacttagtccctgagtgtggttgaggtttgaggtcc 

tggtcgagcagggcagtactggaccaggtctacgtcagcattcaggttcaatggggacaccagtggcttc 

aaacttcctgatctaattatgtttttagacacttagaagttattgaggactttaaagagcttttgtttat 

ttgggttaatatttatgacatttgacattgaaacaaaaatttaaaatgttatcttttaatttatgttaaa 

atagcattaataaatcagttataggttaatgtagataggatgttttgtgaaaaagcaatctattgtgtcc 

aaataaaaaaaacaaaaagtgtgacactggttaactttttccagatctcatgtctggcttaataagagat 

atttgtattatcatatctgcctttgtattaaacctattggtatatcataggtcatgttagctcaaaaaaa 

ctttactgcacactactgagagaatgagatgaaaaacgattaatgtttcattattattattgtgaaaata 

ttattaacactggggactccttaagagtacatcagagttctctctaggaatcccaaaaccacattttgaa 

actagaatagtggatcctggaagttaatccatgtgctggttaattttagatgtcaacctggtgtttccag 

aagagattggcaagtgagtcagtgggaaattctctccttctgttggctgggtgcccaatacaacaaaaag 

gcagaggaaaggcaaattcttctctcctctggagctgagacactcttcttcttctgcccttggacatcag 

aactcctggctctccggcctttgaacttcaggacttgtaccaggaggccctgggttctcaggcctttggc 

tttggactgagagttacacaatcagcttccctggttctgaggctttcagacttaaactgagccatgctac 

cagcatcccagggtctccagcctacagatgagctgttgtgcgatttcttagcctccataatcacatgagc 

caatctccttaataaatgcctgctcatagatctgtatctacatctatatctgtatgtgcatctatatcta 

tgcctatatctatatctatatcatattgattttgtctctctggagaaccctgactaataaaatgagacat 
ctaaaaaaaaaaaaaaaa 



CDNA27> 

gacccacgcgtccgaaaagctatggcctcaaccaccagcaccaagaagatgatggaggaagccacctgct 
ccatctgcctgagcctgatgacgaacccagtaagcatcaactgtggacacagctactgccacttgtgtat 
aacagacttctttaaaaacccaagccaaaagcaactgaggcaggagacattctgctgtccccagtgtcgg 
gctccatttcatatggatagcctccgacccaacaagcagctgggaagcctcattgaagccctcaaagaga 
cggatcaagaaatgtcatgtgaggaacacggagagcagttccacctgttctgcgaagacgaggggcagct 
catctgctggcgctgtgagcgggcaccacagcacaaagggcacaccacagctcttgttgaagacgtatgc 
cagggctacaaggaaaagctccagaaagctgtgacaaaactgaagcaacttgaagacagatgtacggagc 
agaagctgtccacagcaatgcgaataactaaatggaaagagaaggtacagattcagagacaaaaaatccg 
gtctgactttaagaatctccagtgtttcctacatgaggaagagaagtcttatctctggaggctggagaaa 
gaagaacaacagactctgagtagactgagggactatgaggctggtctggggctgaagagcaatgaactca 
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agagccacatcctggaactggaggaaaaatgtcagggctcagcccagaaattgctgcagaatgtgaatga 

cactttgagcaggagttgggctgtgaagctggaaacatcagaggctgtctccttggaacttcatactatg 

tgcaatgtttccaagctttacttcgatgtgaagaaaatgttaaggagtcatcaagttagtgtgactctgg 

atccagatacagctcatcacgaactaattctctctgaggatcggagacaagtgactcgtggatacaccca 

ggagaatcaggacacatcttccaggagatttactgccttcccctgtgtcttgggttgtgaaggcttcacc 

tcaggaagacgttactttgaagtggatgttggcgaaggaaccggatgggatttaggagtttgtatggaaa 

atgtgcagaggggcactggcatgaagcaagagcctcagtctggattctggaccctcaggctgtgcaaaaa 

gaaaggctatgtagcacttacttctcccccaacttcccttcatctgcatgagcagcccctgcttgtggga 

atttttctggactatgaggccggagttgtatccttttataacgggaatactggctgccacatctttactt 

tcccgaaggcttccttctctgatactctccggccctatttccaggtttatcaatattctcctttgtttct 

gcctcccccaggtgactaaggaaaagagcagaagctccttggtttaaccagcacagagaaaataatataa 

atcccataagggcagacgtttggtctgttttcttcgctgtcatttccttagtagttagactagtgctgag 

attttagtggatatataattgatttatgttgaatatatggacttagcaactaaaaataccacagatggtt 

aacctggactggggcaaagcaagataatagtgatgatcgtatgttgctgtctccatccgtctttaatggg 

tcagggctttgatttccaagggtcttcaggtgatgagtaggggtacccacaagtcagaaggtctgcgttc 

tcctagtttgtttgctgccatttgaactcatgtagggaatgaaagaaagctgcaattatccgccaactgc 

atttaaaacaaaacaaaacagaaaaatcaaaataacattgactcttccaaccactgacatgttgtttaat 

aatctaagcggcagtcctggaggctaccagacttactgagttctacctgagaaacagccaagcaaagtgt 

gagagaagggttaagactggcttacaatgagatgcttcaaatgaaaagggaattatgagtaaaattgaac 

tttgatgggggattcagttctggaaaagaatttggtattttccagtctgctaggaccaattaccttgaaa 

tattttaaaatctcagtaaatagttattgctgaaatggctgttggcagttcttattatgattcagagaag 

agcaaatagaccttaacttcattttgaaaaagaccaaattaccatacccgagtgagtaatgacaggacta 

caactaaaacataaacaacattaatgatgaccataaaaagtcacaaaattgctaaatgttataatttaga 

gttgacataaaaattgatggccaggcatggtggctcacgcctgtaatcccagaactatgtgaggctgagg 

caggtggatcacttgaggtcaggagttcaacaccagcctggccaacatggtgaaaccctgtctctactaa 

aaatacaaaaattagccgggcatggtggtaggggcctgtaacccagctactcgtgaggccaaggcaggag 

aattgcttgagcctgcagcagctgcagtaagccaagatcatgctgtgcctcaaggaaaaaaaaaattaat 

gtttactgatatttgttgaagtcctacaacatcacctctgagaataggagaaatgaagcaacagttgtgt 

ctagatgtcagaggcatggctgggcctccatctctgcctaagggagatataaaagagttcaaactattgc 

ccatgttccccagggtcagaagttctaattatgatgatagaggctgggttgtaagtagtaagtgaagggt 

agcagaatatgccatctttggcataagaagtattttgagttgaagacaattgagaaaaaaaaaaaaaaaa 
aa 



>CDNA22B 

ggacagaaaactccctccttttccaagttagccttatagtctagggcttaaaatactggtttaatggtga 
aggtaagtgcttttcttctttttgggtagaaggattattactaacttaccaaaggtccattaaggggagg 
gaacagttttaggagaagtcagagaaaagacattaacagcaacataaggatctccatctggtaatattgc 
ctaattccaaaatgaagagactctctgaaaaagataactgattcaatgaagaccctagggcaaggcttga 
gaagccactggtaccaatggacactgtggacaatggtcatttctccaaggacgctataaaagactgtcgt 
agtaaaagagattcagggcacagggaaactccaccacaaagcgtggtaccatttcccacagaagctaaat 
ggacgggaagcctgccaccaggaaaggtccagatttctgttcattacgctatgggctggctcttatcatg 
cacttctcaaacttcaccatgataacgcagcgtgtgagtctgagcattgcgatcatcgccatggtgaaca 
ccactcagcagcaaggtctatctaatgcctccactgaggggcctgttgcagatgccttcaataactccag 
catatccatcaaggaatttgatacaaaggcctctgtgtatcaatggagcccagaaactcagggtatcatc 
tttagctccatcaactatgggataatactgactctgatcccaagtggatatttagcagggatatttggaq 
caaaaaaaatgcttggtgctggtttgctgatctcttcccttctcaccctctttacaccactggctgctga 
cttcggagtgattttggtcatcatggttcggacagtccagggcatggcccagggaatggcatggacaggt 
cagtttactatttgggcaaagtgggctcctccacttgaacgaagcaagctcaccaccattgcaggatcag 
ggtcagcatttggatccttcatcatcctctgtgtggggggactaatctcacaggccttgagctggccttt 
tatcttctacatctttggtagcactggctgtgtctgctgtctcctatggttcacagtgatttatgatgac 
cccatgcatcacccgtgcataagtgttagggaaaaggagcacatcctgtcctcactggctcaacagccca 
gttctcctggacgagctgtccccataaaggcgatggtcacatgcctaccactttgggccattttcctggg 
ttttttcagccatttctggttatgcaccatcatcctaacatacctaccaacgtatatcagtactctgctc 
catgttaacatcagagatagtggagttctgtcctccctgccttttattgctgctgcaagctgtacaattt 
taggaggtcagctggcagatttccttttgtccaggaatcttctcagattgatcactgtgcgaaagctctt 
ttcatctcttgatatgcaagtttcctcatgggaatctcaaggggatttgggctcatcgcaggaatcatct 
cttccactgccactggattcctcatcagtcaggattttgagtctggttggaggaatgtctttttcctgtc 
tgctgcagtcaacatgtttggcctggtcttttacctcacgtttggacaagcagaacttcaagactgggcc 
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aaagagaggacccttacccgcctctgaggacataaagttacaaacttaaatgtggtactqaqcataaact 

ttagtcttccctggggagcctttataagacactgatacttgggacccactccagagattctgaatqaatt 
ggtctggggtggaacccagatactactaatttttagatactccttagaggtttctagcatgcg?2gggg 
ttgacaacagctggacaaacttgaaaagtcaattcatgtggcctttgaattttcctlattgglaaSIc? 
aaataaataaaaattcatgtgaaaatgatcactgataaatatcttcatggtggggcaggt?lttgga?gc 
agagaagatctgctcg g aattgtagccatatgttacagatctcagcacc|atSgaaI?gtaaagc?a?a 

ttctattaaaatttacatgctaaaataaaaaaaaaaaaaaa 



>CDNA22E 

acgcgtccgcccacgcgtccgcccacgcgtccggtcggggccagagcgcaggtgtacctggcggccgtgc 
tggagcacctgaccgccgagatcctggagctggctggcaacccggcccgcgacaagaagacccgcatcat 
cctgcgccacctgtagctggccattcgcaacggcgaggagcttaacaagc?gct g ggcgaag?c"ca?c 
gcgcagggcggtgtcctgcccaacattcagggcgtgcttctgccccagaagaccaagagcclccacaaaa 
ccaagggtgaaaaccattcactaggagaggagaaacacaatggccaccaagacagagttgaqtcccacaa 
^ aa ? gga H a ? Caa H aaCgCa f aagatatgCaa 9 t gg at g a gacactgatccccaggaaaggtccaagtt? 
at g ttctgctcgctatggaatagccctcgtcttacatttctgcaatttcacaacgatagcacaaaa?gtc 
atcatgaacatcaccatggtagccatggtcaacagcacaagccctcaatcccagctcaatgattcctcta 
aggtgctgcctgttgactcatttggtggcctaagtaaagccccaaagagtcttcctgcaaagtcctcaat 
acttgggggtcagtttgcaatttgggaaaagtggggccctccacaagaacgaagcagactctgcagcatt 
gctttatcaggaatgttactgggatgctttactgccatcctcataggtggcttcattaqtqaaaccctta 
ggtggccctttgtcttctatatctttggaggtgttggctgtgtctgctgccttctctggtttgttgtqat 
ttatgatgaccccttttcctatccatggataagcacctcagaaaaagaatacatcata?cct?ct?gaaa 
f aa ^ aggtcgggt f" ctaa 5 ca g cctctt c ( =catcaaagctatgctcagatctctacccatttggtcca 
^^ g " taggC tgt " ca g c catcaatggttagttagcacaatg g ttgtatacataccaacttacatcag 
ctctgtgtaccatgttaacatcagagacaatggacttctatctgcccttccttttattgttqcctqqatc 
ataggcatggtgggaggctatctggcagatttccttctaaccaaaaagtttagactcatcactqtqaqqa 
aaattgccacaattttaggaagtctcccctcttcagcactcattgtgtctctgccttacctcal^^gg 




g ca ^ c tgtcattgtacccactgtcagcggatttcttcttagtcaggaccctgagtttgggtqqaqqaa 

tgtcttcttcttgctgtttgccgttaacctgttaggactactcttctacctcaLfttgglgalgSgat 

gtccaagaatgggctaaagagagaaaactcactcgtttatgaagttatcccaccttggatggaaaagtca 
"^^tattgcataaaatagaaggcttccg^ 

gtggctcttttcaattatgagatcagttcattattttattcagacttttttttgagagaaatgtaagatq 
aataaaaattcaaataaaatgataactaagaaaaaaaaaaaaaaa aatgtaagacg 
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CACACACACA CACACACACA CACACACACA CACACAAATG AGGTATATAA AGGGTCTCCT 
AAAATGTCAT CTGATATTTG TTATTTCATA TTCTCAGATT TTTAATCCAT TTAGGTAGGT 
CTATTTTAGA TAGCCTTGTC TGAAACAGAG CTGGGACCTG ATGAGTGAAA ATGAGCTCAC 
CAGAAGAAAA ATCAAACAGG CATTTCAGAG ATTGAGGCCA AGAAGTTAAA TGTCTTAAAT 
GGGCAGAGCT TAGCTGCTTG ATGTGAAAAG AGACCAGCGT GGCTGGAACA GCAAAGGAGA 
ACAGCAGAAG AGGTGAACAG AGGCCAGAGA TGGTCACTGA GTGGGCCCTT AAGTCATGGT 
AAGGAGTATG GAGAATGAAT TATTGCATGT ATTGAATATG TAGGTGACGT GACTCACAGA 
TACTTTGGAT TTGTAGAGAT GAAGGAAATG TAGCAAGTGA CACTCTTAGA ATGTTGATTT 
GAGTAAATGG TAGTGTCAGT TATTGAACTG GGGAGAACTG GAAGGGATAA CAGGCTTAAG 
GAGCACGTTT ATTCCTGTGT CTTGGAAGTG TTTAGGGTGA AAGACCTATT AGAGTTCTAA 
ATGGAGATGT CAAGTGAAAA TGTGGCTACA CACATTTGCA TTTCAGAAAA AAGGTCAGGC 
TGGAGATGTA AAATTGGAAG TTTACTGCAT ATAGATAGTC TTTGGAACCG TAGTATTGAT 
GAAGCCATTA ATGAGACAGA ACAAAGACTA GGGAGCAGAG CCAAGCTCCA AGTTTCTAAA 
ATTTAGAGGA TAGTATAGTC TGGTCATTTT GAGGTGAATA CTTAATAACA GAACAATTTG 
TTGAAGTGTA AATTTAGAGC CCTACACTTT TAGCTCTGAC TATTAACGAA TACAGGAAAG 
AATGGATATG GTTATCTGCC TGGTGTCTGT GAAATAATTT AAGCCAGGAA GAGATCCTCA 
CCAGAAACTG ACTATGCTGG CAACTTGGAT CTTAGATTTC CAGCCTGCAG AATTGTTAGA 
AAATAAATGT CTATCGTTTA AG CCACC AGT CTGTAGTATT TTGTTATGGC AGTCCAAGCT 
GACTAAGTTT TGGTACCCAG GCGTGGGATG CTGCAACAAC AAATACCTAA ACATGGGGAA 
GTGGCTTTGG AAATTGGTGA TGGGTAAAGG CTGGAAGAGT TTGAGGTTCA TACTAGAAAA 
AGCCAATTGT GAAGGGACTA TTGAAAGAAA TATGGACATT AAAGGCAATT CTGGCAAAGG 
CTCAGAAAGG AAGAGAGCTG GACAGAAAGC TTCCATTTTC ATAGAAACTT AGATTTATAA 
CGATCATGGA TAGAATATTA AATATGCTGG TTAAAATATG GACTTTAGGC CAGGCGTGGT 
GGCTCACGCC TGTAATCTCA GCACTTTGGG AGGCTGAGGG CACAGATCAC GAGGTCGGGA 
GTTTGAGACC AGCCTGGCCA ATATGGCGAA ACCCTGTCTC TACTAAAAAT ACAAAAATTA 
GCTGGGCATG GTGATGTGCT TCTGTGGTCC CAGCTACTCG GGAGGCTGAG GCTGAAGAAT 
CGCTTAAACC CGGGGGGTGG AGGTTGCAGT GACCCAAGAT CACACCACTG CACTCCAGCC 
TGGGATACAG AGCAGGACTC CACTCCCCCC GCCACACACA CACAAAAAAT ATATATATAT 
GGACATTAAA GTCAACTCTT GTGAGGTCTC AGATGAAAAT GAGGGACAGG TTATTGGAAA 
CTGTAGAAAT CACTGTTCTT GTTACAATGT GTCAAGAACT TGGCTGAATT ACGCTGTAGT 
GTTTACTGGA AAGAACTTAT AAGCAGTAAA ACTGGATATT TACCAGAAGA GATGTCTAAG 
1861 CAAAGTATTG AAGGTGTGAT TTAGGTCCTC CTTACTGCTT AAAGTGAAAT GTGAGAGGAA 
AGAGCCGAAA TAAAGAAGGA ATTTTTAAGC AAAACACAAT CAGAACTTGG AGATTTGGGA 
TAGATTTCTC AATCTATATT GTAAAAATTG AGAAAGTTTT TCTTGAAGAG GTATGGTTGA 
ACAATGTTTT CTTTTTCTTT TTTTTTCTTG GTTTTATTTT TATTTTTATG TTTTTTGAGA 
210X CAGGGTCTGG CTATGTCATC CAGGCTGGAG TGCAGTGGCA CAATCTCAGT TCAGTGCAAC 
CTTTGCCTTC AGGCTCAAGC AATCCTCCCA CCTCAGCCTC CTAAGTAGCT GGGACTACAT 
GTATGCACCA CCACACCCTG GCTAATTTTT TGTTGTTGTT TATAGAGATG GGGTTTTGAC 
ATGTTGCCTA GGCTGGTCTC TAACTCCTGA GCTCAAGTGA TCTGCCCTCC TCAGTCTCCC 
AAAGTGTTGG GATTACAGGC GTGAAACACT GAGCCTAGCC TGAACAACCA TTTGATAAAG 
AGATAATGGG TGTGACCCAA GGATTTAATC AGCCATCTCA GCAGAAGCCA GGAAGAGAGA 
TGGGATTATT CCAGCAGAGA CACTGCCAAT TTAAACTAAC GTAGGCAGAG AAAACAGAAA 
GGAACAAAGG AAGGTTGTCG ACTTTTTGAA TTCTATAGAA CAGGATCATA GAGCTACCTG 
G CTGTC AATG TGTACTATTC TTTAAGAAAA GGAAAGACTG ACCCACCAAA GGCAACTTAC 
AAGATCACTA GGGCTGACTC TTTTGTTTTT TCTTGAGGCA GTCTCACTGT CACCCAGGCT 
2 701 GTAGGGCAAT GGTGTGATCT CAGCTCACTG CAATCTCCAC CTCCCAGGTT CAAGGGATTC 
2 761 TCTTGCCTTA GACTCCCAAG TAGCTGGGAT TACAGGCTCT AAATCTGTAC CCTCCCGAGT 
2821 AGCGCTCCTG CCACCACTTG CCCAGCTAAT TTTTGTATTT TTAGTAGAGA TGGGGTTTCA 
2 881 CTATGTTGGC CAGGCTAGTT TGGAACTCCT GACCTCCAGT GATCCATTCT CATTGGCCTC 

2 941 CCAAAGTGCT GGGATTACAG GCAGGAGCCG CCAGGGCTGC CACTTTGATG TCAGACTCAG 

3 001 AGAGTACAGA TGGGATAGGG TGGGGGTGGG AACATGTAGT CAAGGCTGAC TCTACCTGTT 
3 061 TCAAAGATGC CCTGCAGAAC TGTGTGGGAG TCTCTCACAG ATGGCTGCCT GGGTGGGACC 
3121 CCACCAAACT GAAAGACCGA GACTTCAGGC AGGGCAGATG GAGTAGGCCA ACTACAGAGC 
3181 CAGAGGTGAC ACTGAGACAC CACTGGGCCT GGAAATCAGG GCATCAAGCC AAAGAGGGTT 
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TTTCTTAAGA CCTAACAGAA TTTGCCTTGC CAGGTTTTGG ACTTGATTAG GACACATTAC 
ACCTTCCTTC TTTCCTATTT CTCCATTTTC TAATGGGAAT GTCTATTATG CCTGTTTCAC 
CATTGTACCT TAGAAG CATG TAACATTTCT GGTTTCACAC GTTCAAAGCT GGAAAGGAAT 
TTTGTCTCTG GATGAATCAC ACATTGAGCC TCACCCGTAA CCTGATTTAG ATGATTTTTT 
AGATGACACT TTGAACTTTA GAATTGATGC TAGAATGAGT TAAGACTTTC AGGGGGCTGT 
TGGGATGGAA TAATTTTTTT TTTTTTTTTG AGACGGAGTC TAGCTCTGTC GCCCAGGCTG 
GAGTGCAGTG GCACCATCTT GGCTCACTGC AAGCTCTGCC TCCCGGGTTT ATGCCATTCT 
,„ n CATGTCTCAG CCTCCAGAGT AGCTGGGACT ACAGGCGCCC GCCACCACGC CTGGCTAATT 
™r T ™ TTTAGTAGAG ATGGGGTTTC ACCGTGTTAG CCAGAATGGT CTCGATCTCT 
.7„l TGACCTTCTG ATCCGCCTGC CTTGGCTTCC CAAAGTGCTG GGATTACACG TGTGAGCCAC 
CATGCCCGGC TGGGATGGAA TAAATTTATC TTGTATGGGA GAAGGACATA CATTTTGGCA 
GGTCAAGGAC AGAATGTTAT GGACTAAACT GTGTCCCCCA AAATTCATTT ATTAAAACCC 
TAAACCCCAG TGTGACTGCA TTTGGACATA GAGCCTTTAG GGGGTACATA AAACTAAAGA 
TCACAGGATA GGGCCCTAAT CCCATTGGGG CTGGTGTCCT TACAGAAGAT GAGACACTTA 
ni?™ CTC TCCACGCAGG CACCAAGGAA ACACCATACA AACACACAGT GAGATGGCAG 
CCATCTGTTA GCCAGGAACA GATTCTCACC ATAAACTATG TTGGCACCTT GATCTTAAAC 
TTCCAGGCTC CAAAACTGTG AGAAAATGAA TTTCTGTTCC AAGCCTCTTA GATATGGAAA 
AAAAGATTCT GTTGTTTAAG CCATCCAGTC TCTGGTATTT TGTTATGGCA GCCTGAGTAG 
GCTAAGACAA TGAAGGATGT GGTAAAACTT TACGTCCCAA CCACATACCA AAGAGGCTGG 
AATTTAGCAT GCTTTCTTCT TTCAACTGTA GGCAATGTGC ACAAGTTCTA AATCCTAAGA 
CATGTTGGCT CCTTTACTCT GCCCAAACTA CAACTCAAAC AAACAACTGT AATATAATAA 
CATCCAATGA AGTTCTGACA TTTCTTCAAC ATGAGTACAG TAATTCAATG CCAGAGAATT 
CATTTTATTT TGAAATCTAC ATGC CAT ATT CCAATTTCTG TTGAAGATGC AATGGTTATA 
4621 TTTATTCTTT TTAATATAGA TTTATCAGAC TGGGCGCGGT GGCTCATACC TGTAATCCTA 
Tnl, ™ TTGAGA GGCTGAGGTG GGCATATCAC CTGAGGTCAG GAGTTTGAGA CCAGGCTGGC 
4 74 1 CAACATGGTG AAACCCTGTC TCTACTATAA ATATAAAAAT TAGCTGGGTG TGGTGGTGCA 
TGCCTGTAGT CCCAGTTACT AGGGAGGCTG AGGTAGAATT GCTTGAACCT GGGAGCAGGA 
GGTTGCAATG AGTGGAAATC GCACCAGTAC ACTCCAGCCT GGATGACAGA GCAAAATAAT 
AAATAAATAC ATAAAATAGA TTTATCAGTT TATCAATAAT ATAGTTTTCT TTTCTAGGTG 
TAAATATAGG TAATGACTGT CCTTTAGTAC ATTTTCTCAT GATGCTCCTC TTACTTGGTT 
TGGTACAATA TTAAGTATTG AAATAAAATA GAGAATCCTG TCGCTACACA TGAGCACTTA 
TTCCATTTGC TCATCTCCAA TATGCACGGG AAATTCTCAA ATTGCTAATA ATCTTGTAAC 
AC AC ATG CAT TATATTCAAC AGGAATATAT AAATTTATAA TTATAATTTA GGATCAACAG 
ATGACAAACC TTTAGAAGGT TTGTATTTAA CCTTAAAATA TAATTTTTTA AAAATTGGTT 
ATAAAATTTC TAATACTTTC TTTTTTGTGA CCTCAAGGGG AAAATATAAT TCTTATAAAA 
= Ann GTTCAAATGA TTTACAGAAT ACAAAAAGTG AATAGAGATG ATGAATGAAT TAAAGGAAAG 
5401 GATATTGCTA CATAGATTTG GAAATTTAAA AAGGGAAATT ACGATTGTTG ATTTTGTGTT 
54 61 AAACTGATCT GCTTTGTTCA AGATACCTTA TGTACCAAAA AATGATTTTA TCTCAGCCTC 
ATATCTCAGT AAATTCCTGA GACAAACTTT AGTCCCTGGT GCCCAGGTGC CTTTGGTAAT 
TGGGAGACCT CTAGGTTTAG CATCCTCATC CACTCGCCCC AATTTAAATA GTCCTCCCCA 
GGGCCATTCA GGCAAGGGAG ATGAAAACTT GCTCAAGAGT TGGAATCCAA CTGAAGCTAC 
CGAAATTCAT TGCTCAATAG ATAATTTTCC CTGGAAGTAA CTAGGGCTTT TGAATATAAT 
JSESSS* TCAAAGTAGA * GG TAAAGTA TTTTGGAGAT GAGGAGACAG GACAGAGCTA 
CGAGGAATGT CCTTTGCTTA GGGACTAGGC TCTTAGCAGT ACCTCTTAGG TAAGAACTGG 
TTAACTGGCA CCTTCTGTGT TTCTCTGAAG CTCCCTTTGC TTAGGGACTA GGCTCTTAGC 
AGTACCTCTT AGGTAAGAAC TGGTTAACTG ACACCTTCTA TGTGTCTGAA GCTCCCAGAA 
C AAACTGC CA GTGAAATTTG GATTTTTGGA ATATAGTTTC TTTTTTCTTG TTACTTTTTG 
TTTTGTTGTT TTTTTTTGAG AGTCTCACTC TCACTGCAAC CTCCCCCTCC TATATTCAAG 
TGATTCTCTT GCCTCAGCCT CCCGAGTAGC TGGGACTACA GGCGTGCACT AGCATGCCCA 
GCTAATTTTT GTATTTTTTA GTAGAGATGG GGTTGGTTTT TTTTTGAGAC GGAGTTTCAC 
TTTGTCGCCC AGGCTGGAGT GCAGTGGCAC GATCTTGGCT CACTACAACC TCCACCTCCC 
GGGGTTCAAG TGATTCTTCT GCCTCAGTCT CCTGAGTAGC TGGGACTACA GGCGCCTACA 
GGTGAACACC GCCACACCTG ACTAATTTGT GTAGTTTTAT TAGAGATGGG GTTTCGCCAT 
GTTGGCCAGG CTGGTCTCAA ACTCCTGACC TCAGGTGATC TACCCACCTC AGCCTCCCCA 
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AGTGCTGGGA TTACAGATGT GAGACACCAG ATCAGCCTCA GAAGACATTT TCTATTGGAA 
AGAGAAAACA CTATTAGCAA CCTATTAGTC TAATATTTAA TACTTAATGT CTTCCTTAGT 
AATAAACCAA CTCTCTACAA CAAAGTGCTT CCTGGCTGCC TAAGTCATTG ATTCATTCAG 
TTCAACATTT TCTCAATGCC CAACAGCCAA GTGTCTCTTG TATGCCAAGT TCTATGCTGA 
TTATCAGTAT TTGAATAAGA GGGGGTCTAC ATCTTAAGTA CTGCTTAAGA TGAAAGCCTC 
TAGGTTAACA AACTTAACAC AATGTATCAT TCACTACTAA ATAGACCGAA TACAAAATCT 
TGTTATTGGA GCCCAGAGAG AAGAATTGAA ATTCAAGTTT TCTCTCTCTC CTTTTCTCAC 
TCACCACAAT AAGTCAGTTG CACCAAGTCT TGTAGCTCTT TACTGAGCCA TGTTTTCACG 
TGTCCCTTTG TTTTATTTGC CACACCCTAA ATAAAAATTG TACTGGCTTT TTTTCCCTGG 
GTTTACAGTA TTAATACATT GTCAAGATTT ACCTCTTCGT GTAGATTCCC TGGGGAAAAT 
TACCTTTCCT CCTTCCCTTA AATTCTTCAG AG GTT AGAAA GCCATTAGTA ACATTCTGGT 
ATGTGGACAA AGTTTACCCA TTATGTATGG ATGTTTTACT CTTTCTATTT TTCTGACAAT 
AATCTCTTAA GGAGGTGTGG TTATAGAATA GTCAGCTGTT ATAAGTACTG TTTTCCTGGC 
CTTACAACTT AAGTTCTTTA AGCTGTTTCT TAGTTTGCTC ATCTCAAAAT TCGGAATAAG 
GATAAAACCT ATCTCTTAGA TTGTTGGATT AAATGAATTA ACATACTGGA AGCTCATGAA 
ATGTGCCTGG CACACAGTAG TGCCTAATAA ACCATCTCTC TTATTCAGCC TGTTTTCTGA 
TTTCAGAATC TACACTTGCT GAGCCAGGTT CTTTTCATTT CAAGGTGAGC AAAAGCATAC 
AAGGAAGAGA TGGAGGTAGG AAGAGATTAA GCCCTAGGCC AAGGTCACAC ACCGATTGGG 
AGCTGGAATC AAAGGCAATT TGGTCAGTGA ATAAAAAGGA TTCCAAGGCC CATAAGGCAA 
TTCTAACCTT AGGATCGAAA TTCTCGGACA TACAGGAAAT GCTGGGGGGG GAAAATCCGG 
7681 TCTTCTCAGC CCAAGAGCCA TGTGAAACCA GACCTTCAAA TCTGATGATT CTCAGCCCAG 
7741 CTGCCCATTA GAATCGTTGT AATTTAAAAA TACCCTCGGA AAATTCTAAT ATGTGGCTAT 
7801 CAAAGGTGAT CATTTGCTTT TATGCCACTT TGTTTTCACC CAAATGGGAC ATCCAACCCT 
7861 TTTCCTTTGA GAGTAGTTGT AGGGAAAGGA GGGGGTGGAG GGAGGGAAGA GCGGAAAAGG 
7921 CTGGATCCGC CCTGAGCCGG TGTCAGTATC TGGGAAGTGG GAGGCGCGTC AGCAGTAAAC 
7981 AGCTTCTGCT AGGATTATTA TCTCCTGCCA CACACTCGGA TTTGAAGGCT CCAAACGAAA 
8041 CAATGCAAAA CGCTTCAGTG GAGTTCCAGA AGCGTTAGAC TAAACGACTG GGTCTGTTTG 
8101 GCCAGTCTGA GCAGCTGGGC GCAGATGCAT AGGCAAGACT TAGCCCGCCT AGACTTTTCT 
8161 GCCCACTTAA TTCCGATCAA AGCAGAAACC GGCCGGGCGC GGTGGCTCAC GCCTGTAATC 
8221 CCAGCACTTT GGTAGGCAGA GGCTGGCGGA TCACCTGAGG TCAGGAGTTC GAGACCAGCC 
8281 CGGCTAACCT GGTGAAACTC CGTTTCTACT GGTGGCGGGC GCTTGTAATC CCATCTACTA 
GGGAGGCTGA GGCCGGAGAG TCGTCTGAAC CCGGGAGGCG GAGTTTGTAT GCAGTGAGCC 
GAGATCGCGC CACTGCATTC CAGCTTGGGC AACAGGAG C A AAACTCCGTT TCAAAAAAGC 
AAGCAAACAA ACAAAAAAAT GCAGAAACCG AGATCCGGAA GAAAACCTCG GCGAGATTCA 
CAGAATCCAG GAAAATAGGT CTCTAGAAAT TTGTCCATGG TCCCAGATCT CCATTTCTTG 
TGGGTGGGGC AGCTGTTACC AGATCCCTAG AAGCAAAGGT TTTTTTGGGG GACCGTGTCT 
8641 CACTGTTGCC CAGGCTGGAG GGCAGTGGCA CGATCTCGGC TTACTACAAC CTCCGCCTCC 
8701 CAGGCTCAAG CGACTCTCCT GCGTCAGCTT CAAGAGTAGC TGGGATTACA AGGTATGTGC 
8761 CACCACGCCC AACTTATTTT TTTATTTATT ATTTTTATTT AGTAGAGAGG TGTTTCACCA 
8821 TGTTGGCCAG GTTAGTGTCG AAGTCGTGAC CTCAGGTGAT CAGCCCCCTC GGCCTCCCAA 
8881 AGTGGTAGGA TTAGAGGGGT GAGCAGAAAG CAAAGGTTTT TGAGTGGCCA CAGGCCCCAC 
8941 TCTATTTCCT TTTCTGCCTG TAATGGCAAC CTAGACGCTT GAGCTTCTTA AAATACAAGA 
9001 GTAAGTTGCA TGTCAGGCAC CG TTCTAC AT TAGGGACATT AGTCTGTTTT ACAGACACCT 
9061 TTCAACTCCC TGGTTAACTT TTAGGTAATA TACTCTGCAC TTTAGCAGGA ATGGGACCTA 
9121 TAACTCTCAC AGAATTAGGA AAGTGAGGCT GCCTACAGCC TAAATTGAGA AAAAAATAGA 
9181 CGGGGGACTA GTCGGAGGAC CAAACAAGGT TACCAACACG TTAGAGTTTT GCCTTCAATT 
924 3 TACATTTTTA AAGTAATCAC AACGAAGTGT TTAGATCACG AGGCATCCCT GCATGTAAAC 
TGTTAGGCAC TAACTATGGT CGATCTTACA AAG C ATTAAC TAGAATATTT CTTTAGAGTA 
TGATAGTACG TAACTGACCT ACTATTACAT ACAAACAGAC CAACC TTTAG TAACAGCGCT 
CCCCAAAAAC CGAAAAGCAG TAATACGCTT TGCTCAAGGT TGGCATAAAA TTAACTTACC 
TTAGTGCCTT TTTTCCTTCT ACCTACAAGC AGTGAGGTTA GCTCTTCCTT TGAAACGGTA 
9541 GGGGGGCTCT GAAAAGAGCC TTTGGGTTTG ATAGCGTTTC CGGGAGCTCA GATACCTGTC 
96 01 AAATCACTTG CCCTTGGCCT TGTGGTGACT CTCGGTCTTC TTAGGCAGAA GCACGGCCTG 
GATGTTAGGA AGGACGCCGC CCTGAGCAAT GGTCACCCGG CCTAGCAGTT TGTTGAGCTC 
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9721 CTCGTCGTTG CGGATGGCCA GCTGCAAGTG GCGCGGGATG ATGCGAGTCT TCTTGTTGTC 
9781 GCGAGCCGCG TTGCCGGCCA GCTCCAGGAT CTCGGCGGTC AGGTACTCTA ACACCGCCGC 
9841 CAGGTACACC GGCGCGCCTG CCCCAACCCG CTCTGCGTAG TTGC CTTTAC GGAGCAGGCG 
9901 GTGCACTCGG CCCACCGGGA ACTGGAGACC AGCGCGAGAA GAGCGGGATT TCGCTTTGGC 
9961 GCGAGCTTTG CCTCCTTGCT TACCACGTCC AG ACATTG CA ATCAGACAAA AATCACCAAA 
10021 ACCAGCGGCC TAAGCTCACG AGAAAACAAA CAAAATCAAG AAATATGTAA AACATGGCCG 
10081 CTTTTATAGG TAGTTCCTGG GGAGTAAATC CGACTTTTTG ATTGGTCGGT AGCAAATGCT 
10141 AGTCAGATAG CCAATAGAAA AGCTGTACTT TCATACCTCA TTTGCATAGC TCTGCCCACG 
10201 GATGACAACT GTGCAGTTTG TCTTCCAATT AACTAAGAGG TACTCTCCAT CCCTCATTAG 
10261 CATAAAAGCC CTATAAGTAG CAGAAATCCG CTCTTTACTT TCGACACATT TCTGGTGTTT 
10321 TAAGATGCCT GAGCCAGCCA AGTCTG CTCC CGCCCCGAAG AAGGGCTCCA AGAAGGCAGT 
10381 GACCAAAGCG CAGAAGAAAG ATGGCAAGAA GCGCAAGCGC AGCCGCAAGG AGAGTTACTC 
10441 TGTGTACGTG TACAAGGTGC TGAAACAGGT CCATCCCGAC ACTGGCATCT CTTCCAAGGC 
10501 CATGGGCATC ATGAATTCTT TCGTTAACGA CATATTTGAG CGCATCGCGG GCGAGGCTTC 
10561 CCGCCTGGCG CATTACAACA AGCGCTCGAC CATCACCTCC AGGGAGATCC AGACGG CCGT 
GCGCCTGCTG CTTCCCGGAG AGCTGGCCAA GCACGCCGTG TCGGAGGGCA CCAAGGCCGT 
CACCAAGTAC ACCAGCTCCA AGTAAACATT CCAAGTAAGC GTCTTAACAC CTAACCCCAA 
AGGCTCTTTT AAGAGCCACC CAGATACCCA CTAAAAGAGC TGTGGCCAGA CGCCAAATTT 
10801 TATTTGGCGG CGGAGGGGTA TTAGAATATA GGAACTGGAG AGGGGTGGGG ACAAGTGTTG 
10861 CAGCTTAGAG AGGGACAAAG GGTCCTGAAC CCGAAAGAAG CCAGCCATTA AAAATGGCTT 
,nQ -" TGGGGTCAAT TCGTTGTGCT TAAATTTAAA ATGGAGACAA GCGGCCATTT TGCTAACTCG 
GCGTTCCCGG AAGAAACCGC AGGCTCGCTT AGGTTTCAGA CCCAGCTGTC TGTCCCTGTC 
TACGTCGCCA GGATCAACGG TTGCCGTAAT GTCATAATTT CGCCACCAGC TTCTAGCCAA 
11101 TAGGCTGTCC TGTCATTTTA AATATTAACC AATCGAGGGA AAGCTGTTTT GAGACTCTGA 
11161 TTTACATAGC GGACCGGAGT GGGAACCTGG GCAGTAACTG CCTAAGGAAG GACTCCCCCT 
11221 CTGTTTTCGT GGCGCACACC TTCGTAGTAT ACTGAAGGGT GTGTC TCCTG GGTTTCCAAC 
112 81 TGCCCCGGTA ATAGTCTTTT AACCTAATAT GCGTCAGTTT TGATAACAAC ACTAAGGCAG 
11341 TACAGAACTA AAGATGTAAG CACTGCGCCA GATGTTGCTT CATACATCTT ATTCTATTCA 
114 01 ACTGGTTTAT TCAAGATTCA AATCAAATCA AATTTTGCTT GAATCCCAGT GCTCAGTCAG 
11461 CCATAAATGG TGTGTTGCCT GATTGAAACT TAAAATCTCC GTAGGGGGCT TGTAACATGC 
11521 AGACAAGTTT GAAAGTTGCT TTAGGAGAAG CCAACTCTTA ACTGCTGGGT AAATTGACAA 
11581 G C CTTCGAAC ACTGAACTGA AGGCCAGTAA GGACTAGGCG CTGGGTGGGG GAGAATGAAG 
11641 AGGAGACGTC ATTAAACTTA GCACATACAC TGTATCTCCT AGAGGACTCT CCCTTCCTAG 
11701 ACAACTGCAG GCCGCTTTGT GGCCTGGGAA ATTCCACATT CCCTTAAGTA TTTTACTCAT 
11761 GGTCTTTTCC AGGTAAAGAT TTTAAGATGA AGGGTTAGAC GTAGTCTACC TATCTTTTTA 
11821 TTCAAGTCTA GAACACGTTT TTAGCACCTA GAAGTTTGCT TTCTCCATTA AAAACCGGGA 
11881 ATATACAATA AATAAAATTA GTGTTAAAGC AGATTTTTAC AAACTTAAAT ACCATGTAAT 
11941 TTAGGTTACA GTTATTTAAC ATAAGGACTG TGTGATCTTA AATCTGCAAT TTCTTTCACA 
12001 CCTGGGAAAT AAACTAAGGC CTGTCTTTGG TGCCAGACAA GGCCTTATAC TTGAACACTG 
12061 CTGTGCAATC ACAGGCTGCC TTGCCTAGAT AACTTATCTG AGAAATTCTG ATGAGAAATG 
12121 AAATTTCCAG AGTCCCTCAC AAGTAAATTT TTTTTTCTTT TTTTTTTTTT TTTTTGAGAC 
GAAGTTTCTC TCTTGTTTCC CAGGCTGGAG TGCAATGGCG CGATCTTGGC TCACAGCAAC 
CTCCGCCTCC CGGGTTCAAG CCATTCTCCT GCCTCAGCCT CCGGAGTAGC TGGGATTACA 
GGCATGCGCC ACGACACCCT GGCTAATTTT GTATTTTTAG TAGAGACGAG GTTTCTCCAT 
GTCGGTCAGG CTGGTCTCGA ACTCCGGACA TCAGGTGATC TGCCCGCCTT GGCCTCCCAA 
AGTCCTGGAT TACAGGCTTG AGCCACCGCG CCGGGCCTAA ATGGTTTTTT TTTTTTCTAT 
12481 GCCTCTAATG GACCTGGTCA CTTATTCCCA TTCAGACTGA CCGCTCTCCT ACCTGCCAAC 
12541 TAACTAATCA GTGTAACCAA AATCTGCAAA CAAAATTCAG TATTCTTTCC CCGCCTTTTC 
12601 CCCTTTCTCT TACATAGATT ATGTTTTTGC CTGTGTTAGA TGAAATAATT CTATTGCTTG 
12661 TTCTCTCTTC TGTACAAGTA CCCAGTAAGC AAATTATTAA CTTCTTGGTC ATTTATTTCT 
12721 GAATTTTCCA CCAAGACAGT GTTTATGTGA GTCATACAAT AAGAACCAAC AGAAATGTGT 
12781 GTCTTGGAAA CAGGTTGTCT ATCCCTGGAC CCTTTGAGTT TTCTGTTCAC TTTCCTTTGG 
12841 CTTTTGCATG CTAAAAGTTT ATCGTCCGCG TTTGTTTGTT TTGGTTATTC TAATTGGACT 
12 901 TGGCTGATTG GTTGCATATT GGTGGCAGTA GTAGAATTTG AATTCTGGTT TTCTGGTCAC 
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12961 



iSS S SSSSS GATGTTGATA SS^"™ °™™« TAACCTTTTT 
13081 TGTTTTTCTT TCTGACTTTA 7 CTCTGTGA GGACACAGGG TTAGAGTTGG 

13X41 CTTCCAAAAC TtoSSSJ £££££1 £22™* GCTTGTATGG CTCTTTcSc 

132 01 CTGTGTTAAG ATGATATgII 5££5£ T ™GCAAA ACCAGTATTC 
13261 AGTGTTAGGA CTAACAGGAG ACAAAAAGGA SSSSSI J™" 1 ™ ^CTTTAAGAA 
13321 AACTGCTATG GCAGAGGCTC TaSoctSS SJSSSSS ACCGAATGTC TGGTCTCAAT 

133 81 CCTTCACGTT CTTTAAGTAA GGTTaSggI Sn^HTJ AGTAATTTCA CATTATTGCC 
13441 ACTATTGAGT CAGGGAAAAA AAAGAGTGCT S?^^ 0 ATAAT <™T TACAAATTGG 
13501 ATTTTCTAAA CCTTAACGAG TTTATOctS SSJSS? GAATAAAACA AAGATTTAAT 
13561 AATTTTCTTC TAAACTGAGA AtSSSU ^CTGGAAACT AGGAAACTAG 
13621 GGACTTCTGA TCTTAATTAC ATACTTTTAT SSSSS CAGCAGTGGT ^CCACCTGAG 
13681 AACCTATGGC TCTGTTTTTA CCCACTtSS GATCAACATG CTAAATAGAT 
13741 TAATTGGCAA TAAGATTGAG AcSSSSi iSSS ^ ATTAGCAG GC3 TTAGCTTTCC 
13801 GGCCCAGGCT GGGGTGCAGT SSSSS ££££££ GAGACAGAA * TTTGCTCTGT 
13861 CTAGCAATTT TCCTGCCTCA GCCTCCCCAG CAACCTCTGG CTCCAGGGTT 
13921 TGGCTAATTT GTGCATTTTT AgSgaSSg TACAGGTGCA CCACCACGCC 
13981 GAACTCAGGT GATCCACCTC GGCcSSS? 221 CGCC ATGTTG GCCA AACTGGTCTC 
14041 GCCCAGAAAA GACTATcSa ££££££ SSS? TTACAGGCGT GAGCCACCGT 
14101 GAATTAATAA ATTATAATGT TTAAATAATT GTGAAATTAT CCACTTAAGG 
14161 CATCAATTTA USSSS ESS?" 0 TTACATAAAG ACTTAaJSJ 
14221 AAATGTGCTA CCTCTTTAAG SSSIJJS AAAAATTTTC CTTGTGCTTT 
14281 AGTGGTCTTA GTTAACAGCT TaSSSSJ ^a!^ PP™*™* GA «TTTCATT 
14341 AACTTAAAAA TATTAATACC TCTTTTAtS rZ^^ ATACTTCAGA ATTTTTAAAT 
14401 ATCTAATCAA GATTTTTTTT gScAAa"£g SSSJUi ATAAGGAAAA TATATAATAC 
14461 TTTATTCTTA TACTGTAAAA ATAATATTAG rSIJi! TTCATTTTAA AAATGGCTTC 
14521 TTCATATTCT AAAAAACAAA TAGTATACAG AAGTTTAGGG 
14 581 ACTAGTTGTA CTGGTTACAT GArJJIJ^ AATTTAACTT GCATTTACTA AATTTCTTCC 
14641 TATTCATTGA SSESSS SSSiiSJ ^f™" A ™™™A AATTGTAAAT 
14 701 TTTATGTTAC TAATTATAAG S^SSS* TTTTAAAAAT GGAATTAAAT 
14 761 GATAACTTTA AGAATTTAGG TfflESSS SSSS^ GTACTGAGTT CACAGTGTAT 
14821 GATACCTGGA CAATTTCTAA AtJSSSSJ 25X12" AAATTAA ™= TCAATCTTTG 
14881 TTTTATGCAA ATAACATTTT SSSSS AGMTianr^ ^CACAAGA AACAGTGTAG 
14941 CATATGATTG CCTTAGAATA 2SSS5S SSJSS^ TTGATAAACA GATAAGAGAA 
15001 ACTGTATACG TGTGGG CGT A GAGGaSSg SSZS?* TTTAGATT * G TAAATCACGT 
15061 CATGCCTATG CGGGAACACA SSSSSS SSST" ATGACTGCCT CTGTTTTCGT 
15121 CTCTGACTCT ATCAAGTACT SSSSS EES?* GCTATGGTTA ATCCAAACAG 
15181 CTACCTTGAG CCTTTACTTA TTtSI^S AGAAACACAA GTAAGCATTC GAGATAATGA 
15241 TTTACTATGG ATTCTCACTC SS^™ ^ G TTACTGTT TGTTAATGTG GTACATTCAA 
15301 ATGATTTATA tJSJSSS ££££££ SSSSS TTTCTTA ™ TTATAtSS 
15361 CTGGAACCTC CATTTTCAGT AcSSSSJ CTGACAACAT TATAACAATG 
15421 TGGATATGTG CTTCCCAGTG tSSSSJS CTTTTATACT TCAGAGCAGA 
15481 AAAAATACAG TTCTGAGATT CATTAA^gI ASSIST* CTGAGAAA T A CACTATCACT 
15541 TCTTCAAAGT CTACAGAGGA AgS^C SSSJS^ T TCTGGAAGTA GGAAGTTTCC 
15601 TGTGGTATTA TTCTGTTTTG TCCTTTTCTC JS^S^ AGCTTCTTGG TTCTTTTACC 
15661 GATCTGGCCC TCCCAAGTAT rSII^S^ CATTATCTG T CTTTCCAGTG ATGAAATTTT 
15721 TAAGATATTG GCATGCTAAC £££££ 

15781 AAAGTTCCTA AATAAGAATA TTTACTAgS 11151^** ACCTTTAT AACTTGACTA 
15841 GTCAAAATAA TCAATTAGGA SSSSSJ TGCCTGTGGC CCACATTTGA 

15901 TGACCAAACT GATCTTTGAG ACcISiSS SJSJSi AAGTTGACCA AACTGATCTT 
15961 AATTTGTACT TTAAGGAATT CTTATaISS CCAATTAAAT TCTTGGAGAC 

16021 CCCTACTTCT GTGCTTCTCT AATATGCAGA SSU!™ CCTCATAAG T TTTTTTTTTG 
16081 AAAAAACAAA AAACAAaIS CT^SSS ^"TGTTACAAA GCCATTGTCA 

— I TATTTTTACC AAAAATGGAG GAGTTGAAAA JSSSSZ SSSSS SS££ 
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16201 CCTACCTAAC ATGGAAATGT TGGTTGTCAG TGGAAAATAC TACACAGAGA TAG CC ATAGT 

16261 GCTGCACAGC CAATCTTAAG TGTTTCTAGA GAATCACTAA TTGTTTCTAG AGAATCACTA 

16321 ATTGTTTTCT TTTAACATTC TTGGTTTATA CAAGAAGAGA GTATCCATAC TAAACTCTTT 

163 81 TCTACTGAAA ATAATGTGCA AACATAACAT CCTATTCCTA GACAGTTTGT AGTTTTTTTC 

16441 TCCCATTTCT ATTTTATAAA TCATCTTTTT AAAATACTTT GTTGAGTGAA ATCAGTCCAT 

16501 TGCTTGATAT ACCTTGAGCA CAAGTAAATA GTATGCCAAA AATTAAATGT CTTTCAGTCA 

16561 CAGTTTGACA AACTCAACTA CCCTGAGCCT ATAGAGTGGT AATAATTGCC CTACTCATAA 

16621 AGATGGGGTG AAGATTAAAT GAAATAGCAC CTATAGAACA CTAGTTCCAG ACGTGGTATC 

16681 ATGCTAGTAA AATGGCTGCA CAGCACTGCT CAATGATGAC AAAAAGTGAA GCTTCTGGAG 

16741 ACAGACTCCA AGTTTGACTC CCAGATCACC ACATATAAGA TGTGGGACTC TGAGGCAGGT 

16801 CATTTAATCT CTCTGTGCAT TAGTATCCTT CTCTATACCT TTACAGTGAT GGTAATAGCA 

16861 CCTACCTTCT AGAAGTATGT GAAGATTAAA GATCCTTAAT GCATATAAAC CACTGTGTTT 

16921 ACTGCTGTTT GACAAATTTT ATTTATAACC ATCTTTACGC TCCTAAAAGG ACTTGAAGCA 

16981 GCTTATGACT GAAGACTTTG GTAGGAGTTG GCCTTCTATA AATTATAAGA ATTTCATAAA 

17041 TTATTTGATA TGAAAATGCC AGTTGATCAT AGTATGTTTA CCGGGGTCCA ACAGGTTGAG 

17101 AAAAAATACA CTTTTTTTCC CTGAACATAT GAAATTAGCT CTCTAGGCAT ATTCCTAAGG 

17161 ACTTAAAGAA TGATAACTAT CATTTCTCTT AAATCTTCCA GATTTGGAAG GATATATATA 

17221 TTCAGCACAT TGACAGACAA TCCCAGTAGT CCTAAATTAA AAGACATTAA AAATTAGTGA 

17281 AACTTTTCCT ACCTTTAGCC TGTGTAATCC TGGATGACCA AGCATAAAAT TAAATTGAGT 

17341 AGAGTATACC ACTGTAACAT TTCCTGAAAG GTATTCTAGG CTCTGAGTAA TTTCTTTGGG 

17401 G TCTGAAG AT CAGTTTGACA TATCCTCAAG TATCATGAGT TCATTATAAT TAAGAAAAAG 

17461 AGAGTAAATC TGGAGAATGA GCCACTTTCT TACTACTCCT TGACCTCAGT TCTTTTTTTC 

17521 AGAGACAGGG TCTCACTTTG TTGCCCAGGC TGCCAGGCTG GAGTGTAGTG GCGCAATCGC 

17581 ATCTCATTGT AACCTCCACC TTCTGGGCTG AAGCCATCCT CCTGCCTCAG CATCCTGAGT 

17641 ATCTGGAACC ACAGCAGGTG CACACCACCA TGCCAAGCTA ATTTTTTAAA AAG TTTTTTG 

17701 TAGAGATGGG GTCTTACTAT GTTGCCCAGG CTGGTCTCAA ACTCCTGGGC TTAAGTGATC 

17761 CTCCTGCCTC AGCCTCCCAA ATTGTTGGGA TTACTAGTGT GAGTCACTGT ACCCCGCCCC 

17821 ACTTCAGTTC TGAGGAGGAA AAAATATGTA ATAATAATGG GACTTTGGTT TGCTGATTTA 

17881 AAGATTCATG TAACCTTATC ATCCAATGCG CAATTTGTAG AATAATTAAT AGAGACATCT 

17941 GGTCTCATGT TTCTACAGTT GCTCATGCCT TGATAGTAGA TCTCCTTGCT GCTGGCTCAG 

18001 AAGGGTAAAA GAGCAGAAAT GATGGGGCTT CTCTCATTCT ATGAGGAAAT AGACCTATGT 

18061 AGAGGAGGCT ACCTGTGGTA AAACCTTATC CTCATCACTT AAAATTCTAG GCTTATTCTC 

18121 TGACCATATC AAGTTTTCAA ATGGTAAAAG AATTGGATTC AAGAGAAATA TGAATAAACT 

18181 TTTGTTTTCA CTTTTCTCCC TCCTCTCCCC CCATTCTCCC TTCCTTTATT TTCTTGTCCT 

18241 TAGTTTTCTT TTCACTTTTT TGTCTACTAT TATTTGCCCA AACTCAACTG TAGGCTAGAA 

18 301 CAAAAAAAAA TTGAAAATTA AAATGTGCCC CTTTTGTTGT TAGACTTGCT TAAACAATTG 

18 361 GGGTAATGAA CCTTGGACAC TAGATTTTAA AACACACACA TTTGAGCTTC AGTGCACTGA 

18421 AATAAATATA TTTTTAACAA TTAAAAAATA AAATTGCATG TTTAAAAAAT CTGCAGAGAA 

18481 CAATACACGT TGTGAGATCT TGAATGGAAG GAAAACTGCT AGCCTCAAGA GTGGATCAAA 

18541 GATGCTCAGC AGGCAACAGA GTAAGAGCAT GTTGGAGGGT TTAGAGAGTG TGCTCAGGGT 

18601 TCTAGGCTCT AAAAATCAGA CAGTCCCCAC GGCCTGGCCT TCGTCGCTGT ATCTTCTTTA 

18661 TGAAAAACAC TAAGTCTTTT TCCTCACTGG ATAAATTTTT ATCCTTCAAG TTTAGATCAA 
ATGGAACTTT AGGACACTGA CTAGGTTACA TTCATCTTTT AAGAGCGTAC AGACATTCAA 
GGGCTAGAGG ATGTGGGTTT ACTGCACAGG CTCATTATCC AACAGCTGTG CTACCTGGGA 
AACTTAACCT CTCTGTGCCT TAATTTCCTC ATCTATAACG CAGGGAGAAT GACAGTAGGT 
AT CTC ATAAG GTTGTTGGAA CAACTAAATG CATTGGTATC TATTGTGTAA AGTGCTTAAA 

18961 ACACTGCCTG G C AC AGAGC A AACATCCAGT GAACTTTAGC CATCATCATT ATCATTGTTC 

19021 TCAGAGTCAA ATACAATATC TCATATCTGA TAAATTACAG AAGTGAATCA ATCACTCTCT 

19081 CTCTTTTCTC CAGGGGGAGA CAACAGCTTT TAGACATATC TTTTCCAACA GTCGTCACTG 

19141 CTGGACACTG TTTCATCTTG CAAATAAACC AATGAAAATG AGTGATCCTA GAAGAAGATA 

19201 AATGGAGGTA TTTTGAACAA TCAAAGAAGG ACAAATGAAC ACCTGGCTGA GAAAAATTAG 

19261 CTCTTTTTTC TATGCATAAA ACTATTAAAA TATTCTTCAT AGAAATTTAT GACACAGGAA 

19321 ACATAAAGAC AAAATTAAAA TAACTCCTAG TATCTCCTAT TCTTTTTATA TGTATATTAT 

19381 ATATACTCAT ATTCATATAT ACATATATCT CACATCATGT ATCATATATA AAATAAATTT 
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AGGTGTCATG 
TATGGATATA 
TGCATTATAT 
TTATTTCCTA 
TCTAAGGTTC 
CCCAAAATAA 
TCACTGTTAA 
ACTCTATTAG 
AAACTGCACT 
AGAATAATAC 
CTCTGAAGGG 
TGTCTCTTAT 
TCCCCATCCC 
ACAGGACTTA 
ATAGGATTGG 
ATTTATTTAA 
AAGGCTATGT 
CAACAGAATC 
AAATTTTGTT 
AAG AC AG CAT 
TAAACTTGTC 
CCTCAATCCA 
GCCAAATGTC 
CCTCATCTTG 
AAACAAAAAT 
GCTTAATGAT 
GCACTGCTGA 
ATGCTGTATG 
TGCATCTATT 
ATGGGACAAC 
AGAGGTGCTA 
TACAGAAATT 
GCTAGGCTGT 
CTTCTATCAT 
CATGTCTAGG 
AGGCATGTAC 
TGCTTTTCTC 
TCTAATGTTT 
AAAATTATTT 
GTGTTGTTAG 
TCTTTACTTA 
GTGGAACAGA 
TCTTTCTTCT 
TTCCTCTCTC 
AGCAGAAACC 
CTATACTCTC 
AGCACATAAT 
AATTCTTTAG 
TTTTTCTGTG 
ACAACTTTTA 
TACTCAGTAG 
TTTGAACAGA 
ATAAAGTTCA 
TAAAAATGTT 



ATATATATTT 
TTGATAATTA 
TATAGATTAT 
AGGATAGACT 
TTAACATATA 
TATGGAAATT 
CCTAATAGTC 
TGAGGGTCAT 
GCAAAATGAT 
CTTAGGTTAA 
ATACTAAACT 
AAACATTATA 
CCCAAATTCA 
TAAGAGGTAA 
TGGCCTTATA 
AAGAAAAAAA 
GAGCTCTCAC 
CAGCCATGCT 
GTTTAAACCA 
CATTGCTGTC 
CAAGGTCACA 
AGGCCAGGAC 
CACACCCCAG 
AATAAATATG 
GCAAAGTATG 
ATCCTTATAG 
TAGACTGTAA 
TTTACTTTTT 
TCTTCAATGG 
TGCACATGAC 
CCCACTAAAC 
CACTTACAGT 
TTTGTTGGGG 
CCTGTGTTAA 
GGTCATATCT 
CATTTTAATG 
TGACTATTCT 
TTCTCTCCTT 
CAGTCATCCA 
CATTATACAT 
CATATAAGTA 
AATAAGATTA 
GGGTCAGGTA 
CTCTTAGACA 
TACTGAACAA 
TCAGTGATTT 
AATTGTTGTC 
TTTAGAGACC 
TCTCTCAGCC 
CAAAACATGG 
TTGAACCAAA 
GTTAAAGTTA 
AAAATAGTGC 
AGATATCAGG 



AGATAAATAT 
TGTATTTGTT 
ATAGCTCACA 
TCATGAAGTG 
CATTGCCAAA 
CCTGTTTTAT 
CTTCAAAAGA 
TCTTCCCATG 
AAACATGACA 
GGCCACATAA 
GCATTTAGCT 
ACTACTCTTT 
TATATTGAAG 
TTAAGGTTAA 
AGAAGAGGAA 
AAAAAGAGGA 
AGTGAGAAGG 
ATACCCTGCT 
CACAATCTAT 
ACTTACAGAC 
AAAGCCAGAA 
TCCTCCACTC 
AGTCAGCATT 
ATCTAACAAC 
TAGAAAACTA 
TCTTGGAGGG 
ATTGGTCCTA 
TTATGGAAAC 
GTATGCACAG 
AGTCAAAAAT 
TAATATTTGT 
GGGTTACCAG 
GCTGGCAGGA 
CCATCTTCCA 
ATGTTCCATG 
CACACCTTGG 
GTATTCTGGA 
GCTTTCAAAA 
GTAATGAGCT 
GTTAAGCATT 
CTTATATACT 
CCTAGATGTT 
CTCCCAGAAC 
TTTTCCAGGA 
ATTATTCAGG 
CCCTGCCTTG 
ATTGCTTATG 
AAGTAATACT 
CTGTAATAGC 
AATTATCTAC 
AAAAGCAGTT 
ATCGTAAAAT 
TTGAAAAAGG 
AAAAGC CAAG 



ACTTAGAAAC 
ATTGACTACT 
CATCTTTGTA 
GAAATACTAA 
TTGCTATTCA 
AG CACTC ATA 
AAAAAAAATT 
TTTCTTGTTA 
TCAATCATTA 
AT ATTT AT C A 
GCATGCAACT 
GAGAAAGTGT 
CCATAAACCC 
ATGAGGTCAT 
GATTCTGCAC 
AGAGAGGGAG 
TAGCACTCTA 
CTGAGACTTC 
GGTATTTTTT 
AAGAAAACTA 
ACAAGTGAGG 
CACATGTAGA 
AGACCAAGAT 
TTACCCATGT 
TGTTTACCAC 
GTTTGTATAT 
GAGAGAAAAA 
ATATGATATA 
TTGAGCTGTT 
CTCAGTCTCA 
ATATCAATTA 
AAGGGATTTT 
GCTGTCTAGG 
TGTATCTTTC 
CAGGAAAAAA 
TTTTCAGAAA 
TTACAACGCA 
ACTGACTCAT 
GTTCATAGAA 
GAATAAAAAA 
TATAGCTGAA 
TCTCCTATGG 
TTCCTAATTA 
CTACAGAAGA 
CTCATCTGAA 
GGGTCAATTA 
TTTGG ATTT C 
TAAAAAAAAA 
ATCGTACTTA 
ATACCCTTTC 
CAAATAAAAT 
AATGTCTGTA 
AAGAATCATA 
AAGTGAGTAT 



TTTTTTATGG 
TCAATTGATT 
CATAAATCTT 
ATCAAAAGTG 
GGATCATACC 
TTTACAATAA 
GAAATTACAT 
GCCATGACCC 
CATGGGAAGG 
GGTGCCTTTT 
GAAACTACTT 
TTACTATGGA 
CAATATGACT 
TAGGATGGGT 
TTGGTCTTCC 
CTCTGCACAT 
CAAGC CAGC A 
CAGCCTCCAG 
TATGGCAGCC 
AGACTAGGAG 
TGAGAAGTTG 
TAGCCACCTC 
GTCTTACCAG 
AAAACATTGA 
TTAACTGACA 
GTGGTGAAAC 
TAAATAAACT 
CCTGGAAATT 
CCCATGCACC 
TGAAGTCGAC 
TGGATACATT 
TTTTCTTGAT 
CTGCCCAAGT 
AACCTCATGG 
GGGTAAAGGG 
ATTTAAGAAG 
ACAGAAACGT 
TAACCTCCAC 
ATGTTTTGGA 
CAACATGATG 
AAGAGAGGTT 
GTGATTTTCA 
AATGGTGGCC 
TGTGCAGTTT 
CAGAGAGGAC 
TTGTCTTGGA 
ATCTCCCAAA 
TTTTGTGTGT 
CACTTGTTAG 
TACAAAACAG 
ACTTGAAAAT 
AAAATTATTG 
TGAAAAGGGA 
GGTAAGAGTG 



ATG TATAATT 
CCCATTTTTA 
TGTTCAAATA 
AAAAACATTT 
AATTTATAAT 
ATTT TAAAAA 
TATTTTAATG 
TATAAGAAAT 
CACTATATAA 
CTGCGGAGGA 
TTACCTACAT 
CTGAATTGTC 
CTATTCCTAG 
TCCTAACTGG 
AAATTAAATA 
ATACTGAGGA 
AGAGAGCCCT 
AAC TGTG AT A 
C AAG CCAACA 
AGAGAAAAGT 
ACCTTGTTCT 
ACAGTCAACA 
GAGACAAATG 
ATCTCATGAG 
GTGATAAAAA 
AGGTGCTCAC 
GGAAGGAGAT 
CGATTGACCA 
AGGCACTGTA 
ATGCTCATGG 
GGGCCACATT 
TGG CAAG AAG 
ATGCAGGTCT 
TCATCTGCAG 
AAAGGGAAGT 
AAAGACTTTC 
C AC CTTAAAT 
GTGGCTTGGA 
CATCAAGTCT 
TGGGTAAATT 
GAAATGTCAG 
GCTATGCTGA 
CTGATCTTAG 
ATAAATGAGT 
ACCTTCTCTG 
CATTGATTTA 
ATAGATGGTA 
GTGTGTGTGT 
ATTTTTAGAG 
ACAAATTAAA 
GAAGAAATCA 
CCAATCAAAT 
CTACTCATTT 
CTGTCAAGTG 
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22681 AAACCCTGCT AATCTCACTG AACATGTAAA AATCTGTAGA TGCCTTTATT TTATTCACTC 

22741 ACACACATAT GTAGAAAGAG AAATATATGG TAAACATTAA AAAAACCAAA TTAGAATGTA 

22801 AAATTAATAC TTTAAAAAAT GGGCTGTATA CTTTTCTTAT CACCGGAGAT AAGAATTTAT 

22861 TATTTTTAAA ATAAAGTTAT TTTCTCTGTG ACTGTTTCCA TGACTTTGCT ACTTAGAAGT 

22921 TAGAGATGCC AAAGTTTATC TAAGAAAATG TTTATGGAAA TATTATTTCA ATAATGAATG 

22981 TTTAGAAGAC TGAATTTCCT GACTGGGCGC AGTGGCTCAT GCCTGTAATC CCAGCACTTT 

23041 GAGAGGCTGA AGAAGGAGGA TCGCTTGAGT CCGGGAGTTC AAGAGCATCC TGGGCAACAC 

2 3101 AGCGAGACCC TGCAGCAAAG TAAAAAGAAA AAAGAATTGA AAAAGGAAGA CTGAATTTCC 

23161 TTTGGGCAAG TCATGTGACA TTCCTGTGCC TCAGTTTCTT CATCTATAAA GTTAATTCCT 

23221 ACATTTTTGG GGAAGGGAGA GAAAAACTTA GGATAGTGAC TGGCACAGAA GAAGCACTAT 

23281 ATACTATATA TATGTGGATA TCATTTGTTT TTATGGTACC ATTTTAGCTA TCTAATGCAA 

23341 AATATGAATC TTTTTTTTCT GGGTCTTAAA TTATGGAATG TAAGAATTTT CTAAATTCTC 

23401 TAATTCTGTG TTAGTTTTAA AGCAATGGAG TAACGTATCT GTCAACTTGT AAATATAAGG 

23461 ATCAACCTGA TCCACAATTT GACCCCTAGC CACTAATATT TAATAGTACA ACACTCAGAA 

23521 ATTATCAAAG GTCAGAGAAG CCAAACAAAT GTAAAAACAT ACAGGTGCTC AGAAAGATGC 

23581 AC CTGTAATC TCTCTAAGGA GAAATATTTT CCAAACTGAG TGACACGGTG CTTTAGTGAG 

23641 TTGTGGAATC AATCTCATGA TTTCCAACCT AGTGTTCTTT TAAAAATGAA CTAGTCCACA 

23701 GTAGAATATA CTAAAGTGCT GGTGCTTAAG ATAGTATTGT TTTCTGGAAA AAAAAAAAAA 

23761 ATTTTTTTTT TTTGAGACAG GGTCTCGCTC TTGCCCAGGC TGAAGTGCAG TGGCACAATC 

23821 ATGCTCACTG CAGCCTTGAC CTCCTGGGCC CAAGTGATTC TCCCACCTCA GCCTTTTGAG 

23881 TAACTGGGAC CACAGGTACG TGCCACCACA CCCGGGTAAT TTTTTAATTG TAGAGACAGG 

2 3941 GTCTTGCTAT GTGCTTAGGC TGGCCTTGTG AACTCCTGGG CTCTAGTGAT CCACTAGCCT 

24001 CAGCCTCCCA AATTTATGGG ATTATAGGCA TGAGCCACCC TACCTGGCCT GTTCCCTGAA 

24 061 TTTTTTTTTC TTTCAGGTGT TTGTGCATAT GTGTGTGTGT ATGGGTATAA CAGAGAGACA 

24121 GAGAGAAAGA AACTTTTCTA TCTCACTTTG CAATCAGAAG TTTGAAGTCT TATCTTTTGG 

24181 CTTTTGTTTC AGAAATATTT CAAATGTAGA CTCTCTCCTT TACCACACTG TCCCCTTAGG 

24241 CAAGGTCTTT GCCATTCTTC TGAGACTATT GCAACAGACT CCCAACTTCT GACTGTGGGC 

24301 CCTTCTCAAA AATGATTGTT TATGCAATAA ATCTAAACCC AAGACAACTA CAACAATACA 

24361 ACAAATTCTC TGCTTAAAAA CTTCCAATGT CTGCCGGGCG CGGCGGCTCA CGCATGTATT 

2 4421 CCCAGCACTT TGGAGGCAGA GGCGGGCAGA TCACTTGAGG TGGGGAGTTC G AG ACTAGC C 

24481 TGGCCAACAT GATGAAACCC CATCTCTACT AAAAATACAA AAAATTAGCC AGGCATGGTG 

24 541 GTGGGCGCCT ATAATCCCAG CTAATTGGGA GGCTGAGGCA GGAGAATTGC CTGAACCTGG 

24601 GAGGTGGAGG TTGCACTGAG CCAAGATCAC ACCATTGCAC TCCAGCCTGG GCAACAAGAG 

24661 CAAAACTCTG TCTCAAACCA AACCAAAACA AAACTTCTAA TATCTACCAA ATGTTTCACA 

24721 CAAGTATTTG GGGATCTTCA CAAATGGCCC TTATGGAGTT TTCCTTTGCT GAGACCCTAT 

24781 GCTCTGGCCA CACTAAACTC ATTCAGCATC CCAGAAAGGC CTCAGCCTTT GTGAGCAAGC 

24 841 TCTTATCTCC AGGCCTCTCA CAAAGACCTG TTCCAGTAGA AGCTCAGGGG AGCACACTGG 

24 901 ACATTATTCC AACAACCCTT TCCCCACAGC TATGCAGCCA AATCTGCCAG CTCAGTTAAT 

24 961 TAATTAAGCA ATTCAGAGAT GAGGGTCTGC CCAGGCTGGA GTGCAGTAGC TGCGACCTCA 

25021 AGCTCCTGGG CTCTAAGTGA TCCTCTTCAG TCTACCCAGA AGCTGGGACT GCAGGCATGT 

2 5081 GCCACCACAC CCAGCTAATT TTTTTTTTTT TCAGTAGGGA CCAGGCCAAC CTAGTCTTGA 

2 5141 ACTCCTGGCC TCCAGCCTTC CGAAGTGCTG TAATTACAGG CATGAATCAC TGCGCCCAGC 

2 5201 CAACCCGCCC AGTCTTGTTA GACATGGGGT CTGTAGTTTC TAGTAGGTTC TTGAGTCTAG 

25261 GGTTCCTACC TCATGTTTTA TAGTTAATTT AGGGGAGGGA CTGTGTCTGT TTATCTGGGG 

2 5321 ATGTAGGGGT GGGCAGGGGG ATAGAGGGGA CTTCAATTAA TGAAACCAGA AG C AAAACTC 

2 5381 AGTTGAGGAC ACCGGTCATG AGAGTGGCCT GATTATGGCC AATCTTACAT AATGTGTGAG 

25441 ATCTTGATAT TACCCCATCC TTGAGAGTCC TCTATAAAGC TACAGGGACT TGGGAGCACC 

25501 TTTAATTACA GACAACCCAT GTTCCTGTGG ATTATGATTT ATTAGATTGC ACATGCCTAA 

2 5561 ATAAAGACAT CCTCTGCAGT CTTTTGACAA TTCTATAAGC ATCTTCTGAC TCCGCAATTA 

25621 GACAGCTAAG AGATCTGTGT TACTTCCCTC ACATATATAA ATAATTTTAA ATAAAAATCA 

25681 TGGCGTGAAT AATTTCTTTC CTCTACCGAT TTGAAGCTAT CCATTTGGAA GACCACTCTG 

25741 AAGAGATGAA ATAAGTCTTC TGCCAAAGAT TACTTATTAA TTTACAAGGA AAAGGGGAAG 

25801 TTTTGTTCCT CTCCGTGAAT TTGATTGAAA ATCGAGGGCT TTCTCGAATA GTTTTGGCAT 

25861 CCAGGGTCAT TTTTCATTAA AAAGAGAAAA GTCATGTCAA ATATGAATTT CCGCAGATTA 
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lilt 1 GCCAGACCT C GTGATCCACC CACCTTGGCC TACCAAACTG CTGGGAATAC AGGCGTGAGC 
29221 CACCGCGCCC GGACTTAGAC CACTTTGTTT TGGCCAATAG GACAACAGCC ATAGAACCCT 
2 9281 CCGCAAATGA GAGCTTGTCC CTAAAGATGC TTTATTTACA TAGCTGTGTG CCGCATGAGC 
CAAAAGGTGA TAACCTTTGT TCAACACGCG CCTCCAGCCC TTCGGTTAAG TCCAAAGTAC 
CATTCTTAGA ATGCTCTAAA ATACATAATT TTTTTTTTTT TTTTTTTTTT TTTTTTTGAG 
2 9461 GAGTCTCTCT CTGTCTCCCA GGCTGGAGGG GAGTGGCGCG ATCTCGGCTC ACTGCAATCT 
2 9521 CTGCTTCCGG GCTAGCTGGG CCTACAGGTG CAGACCACCA CGCCCGGCTA AGTTTTGTAT 
29581 TTTTTTTGGT AGAGGGGGTT TCACCATTTT GGCCAGGCTG GTCTCGGATT CTTGATCTCA 
29641 AGTGATACAC TAGCTTTGGC CTCCCAAAGT GCTGGGATTA CAGTCGTGAG CCACTGCGCC 
29701 CAGCAAAATG CTTTTTGTGG AGCCAATCAC TTTATTAGCG CTTACCTCTC TATGCCTACT 

2 9761 TTATGCTTTG AAATTTTGTC ACAGTGTGGC CGGTCATGGC AAACACAATT CATTCTTATG 
CAGGATGTCA CGGTTATTTC TGTCATCCAA ACTCATTCTC GCAACGCATT TC AG CTCTTT 
AAACGACTTT GTGAGCGGCC CTGAAAAGGG CCTTTGGGTT TTTTTGTTTT TGTTTTTTGA 
AGTTCTCAGG AGACCGCGTA TTCTTAGATT CAGCCGCCGA AGCCATACAG AGTGCGCCCC 
TGACGTTTTA GGGCATATAC TACATCCATG GCTGTGACAG TTTTGCGCTT GGCGTGCTCC 

30061 GTATAGGTGA CGGCGTCTCG AATAACGTTC TCTAAGAAAA CCTTAAGCAC ACCTCGAGTC 

3 0121 TCCTCATAGA TAAGACCGGA AATGCGCTTG ACGCCACCGC GCCGAGCCAA ACGGCGAATA 
GCCGGTTTTG TAATGCCCTG GATGTTATCC CGGAGCACCT TACGATGGCG CTTAGCACCA 
CCCTTCCCCA AGCCTTTTCC GCCTTTGCCG CGACCAGACA TGATTCCTAT CGCAGTGGAA 

3 0301 GGTATGAACT GAAACAGTTC CTTAAATACA AACTTGGCGG AC CTG ATTG A AAACAACATG 
3 0361 AGTTGGCGCG GTTTTTTTTT TTTTTCAAAT TTGGTCACCA AGTGGGTGGA GCAAGAAAAA 
3 0421 C TGTTTC ATT ATGGTTCATT GTTTTGATTG GCCAGTGACA GCTTGCTCTT TGTGGGAGTG 
3 0481 GAAGGGTGTT TGCAAGTTGA ATGCGCTGTA TTCCTGTCAG CTTAATGACG CTAAGCATAG 
3 0541 CCCCATTCCA CATTTCTTTT TATTTCCACT TGCTAACTAA TAAATTACGG AATAGTTTAT 
3 0601 TGGGGAACAT ACAAATAATG TTTAAAGGAG GTCAGATTTA TAGGTCAAGG GATTTACCCT 
3 0661 CCCAATCATT TTAATATTTT TATTTAAACC AGGCATTTTG ATGGCCTTCT CTGTGCTGGA 
3 0721 CAAGGTATAA GTTTGGCTAT GAAGTTTCAC TCCTAAAGAC CCTATGTTTT GGGAAGGCAA 
3 0781 AAAGGTAGCC AAATAATTGC AAATTAAAAC CTCATAAGTG CAAACTTCTT CCTCGTCACT 
3 0841 TTCCCTATCT CGATTCAAAT ATTTGTTGAA TGACTCATTT TTCTGCAAAA GTCTGAGAGA 
3 0901 GACAGGGAAT ATAAACTTAA GTCTGGATAA TATGTTTTCC CGGGACGCTC TTCCTGGTCT 
3 0961 GCTGTGCCTG TTTGCTGTGC CTGAAATTCC AAACACTCTT CCCTTCCCTC CGTTTTTAAT 
31021 CCCCTTTCAA CTTGCTACAG CTTTAGAGAA AAGAACATTC GTTTTGTACA GTTGGGGATT 
31081 AATTGAAGTG TAGGGCTAAT ACTTGATTAA GGTCATTACA AAATCTACAG GGTCTTCCTC 
31141 TGGGAGGTTT TTGTGATAAG ATTATTGGTG TTAAAATAAG GCTAATCCCC TTGAAAAATA 
31201 AATAGAATAG CAGAATTGGG TCTGAATGTG GTTTGAAGAA AGGGACTTCT CAATTCAAAA 
31261 TTTTATTCTT AGCTTCCTGC GGGAGCTTTC CAGAATGCCC ATAAGATCCA CTTTTGTTTA 
31321 AAAAACAAAA ACAACCCCAC CCACCACTCT CTGGTTAATA AATGAATTTC TATTGGGAAT 
31381 ATTTAGAATG GGGCTGTGGC CTGTGAGAGA CATTATATAG TAACCTCAGA CTTGCTCACA 
=144 1 TGAAGAGAAG AAATCCAGGA ATGGAGAAAA AAGACCCAGG AAAGGCCAGA ATGCTCTACA 
TGTCATATTG TTTGTATCAC TTCTGAAATA ATTGATTACA TTCTTCTGCC CCAAATTGAG 
TTCTTAGGTT CTTCCACTCA CTGTCCACAT GCCACAACAC AGACCTTATA ACTAGAGACT 
TAGCTAGGAA GAAATGTCAA ACATTACAGA GAAAAAATGC AGAGTCTGAG ATCATAAGTA 
AAACTCTGAA ATCTCAACAT GCCTTTTAAT TCATGAAAAT AAAAAATATA GCAGCATATG 
CAATATGACA ATTCTCTGAA AACATACATC ATGTGAACTA CCCTGGAACA CATCTCGCCA 
AGTGCCATCT TCATTTTAAC CAGAGGTCTA GGATGCCTTT CCTTTATTTT GCCTATTATA 
TCATTTATAA AACCCCATTT TTATTTTGAT ATTTTATTTA CTTTCTATTT CCTGCTCCTA 
ATATCTCCTT TCTAAACTTT TCTCAATGAC AGTGACTCAA AAACAATGAA TGTCAGAACA 
AATATTTAAA GGATCTGTAC ATGTAGATAT ATATATTTAA AATGGATTCT TCCACTCTGC 
GAAGAATTCA GGCATACTCA ATCTTATGGT TAGGGAGAGA TTAGGCTCAC TCGCCTAATC 
32101 TGTATGGCTT CTCGTTCGCT TTCCATTTCA CCTTCCTCTC ACCCATCAGA TCAAACTCAT 
32161 TCATTGAACA AGAGACCTAA GCCCTTCAGA TTAAAACTCT GCAAACAAGT TGTGGTTGAG 
32221 AGGATACATG AAGCATTCAA ACAAATAAAT CTATGATATT AATCAGAGGT TAATCTATGA 
32281 TATTAATCAG AGGTTAATGC AGTGGCTCAC GGCTGTAATC CCAGCACTTC AGGAGGCTGA 
GTTGGGAGAA TCGCTTGAGC TCAGGAGTTC AAGACCATTT TGGGCAACAT AGCAAGTCTT 

Figure 8 (Page 10 of 73) 



31501 
31561 
31621 
31681 
31741 
31801 
31861 
31921 
31981 
32041 



32341 



SUBSTITUTF SHPPT /RULE 9R\ 



WO 98/14466 



PCT/US97/17658 



26/162 

32401 CATCTCTACT TAAAAAAAAA TAACCAGAGG TGTTATGAAA ATATAAATTG TCCAGAACTA 

32461 CCCTCCACAA ACTAACTCTC TCAGAATATT CGATATGAGG AATGAAATAT GGTGTGTGTG 

32521 TGTGTGTGTG TGTGTGTATG TGTGTGTGTG TGTGTGTGTA TGCACCTATA TATGGCACCT 

32581 ATATATTCAA CAAACAATTC TGATAATTGG CCAGGGTTGA GAATGACTAG CAGCCCAGCA 

32641 TACACTATCA GTTTTAAGTA TATAATTGCG CTTTAGTAAA ATGTAAAGAA ATCCCAGAGT 

32701 AGAAATACTT TTAAGCTATA TTACAGGTGA GAAAATGCAT AAGTATAGTC TCACCCAACT 

32761 TAGACTATGG GGGCTTTATA ATGTCACAAC AGTTGTTTCC AGGCATTTGG GGACATCACC 

32821 ACTGGTCTTG GGCAAGAAAC TCCTCTAGCC AATGGCTGAT TTATCTCACT CCCATCTAAG 

32881 GCTTCACTGC ATTTCTCTTT TTCAGCAACC TAACTTATTT AAAAATATCC ATTTTCTGAT 

3 2941 TCATTTTTTT CTGAATTAAA CTGTCAGTAC CATTGGCACA CCTTTGGTTC CGTAGCATAC 

33001 CTGTGTCTCT GCTGTGTTTT TTTTTTACCT CCACTCCTTA CTTTTCTAGA AAAAAATCTC 

3 3061 TGCTTTTTCT TTTCAGTTTA AATTATTTCA CAAAAAGTTT TCTTGACTTG CACTTCCTAG 

3 3121 GCTTGCTGTC CTTGTGTGGG CACGCTCCCA TAAACACTAT TAATACACTT CGATTTGTTA 

3 3181 AAAATAAAGA TATCTGGACA GAAAATTTCT TTTCTTTTTT TAAGATTTTA AAATTTTTAA 

3 3241 TGTTTATTTT TTTCCTAGAC TGGAGTACAG TGGCACCATG ATGGCTCATG GTAGCCTACA 

333 01 CTTCCCCGGG CTCAAGTGAT CCTCCCACCT CAGCCTCCCA AGTAGCTGGG ACTACAGGTG 

3 3361 TGCACAACCA CACCTGACTA ATTTTGTTTA TTTGTTTGTT TTGTTTTTTG AGATGGAGTT 

3 3421 TCGCTCTTGT TGCCCAGGCT GGAGTGCAAT GGCGGGATCT CGGCTCACCG CAACCTCTAC 

3 3481 CTCCCAGGTT CAAGCAATTC TCCTGCCTCA GCCTCCCGAG TAGCTGGGAT TACAGGCATG 

3 3 541 CATCACCACG CCCAGCTAAT TTTGTATTTT TAGTAGAGAC GGGGTTTCTC CATGTTGAGG 

3 3601 CTGGTCTGGA ACTCCTGACC TCAGGTGATC TGCCCGCCTC GGCCTCCCAA AGTGCTGGGA 

3 3661 TTACAGGCGT GAGCCACCAC GCTCGGCCAC TAATTTTGTA TATTTTGTAG AGATGGGCTT 

33721 TCCCTGTGTT GTCCAGGCTG GTCTTGAATT CCTGGGCTTA AGTGATCTGC CCACCTTGTC 

3 3781 CTCCCAAAAT GCTAGGATTA CTGGCGTGAG CCACCAGGTC TGGCTGGAAA GATAATTTCT 

33841 AACATTATCC TCTCTTAAAC ATTTGTTTCA AAAATTTTAC AAACATGAGA GTAATTAAAT 

3 3 901 TTGATTTTCA AAATTCCCTT GAATACTTTC TTAATAGCAC ACAGAAAGCA CAAAGTATTT 

3 3961 TACATTTGTT TTAATGATGA AATTGTGAAC CCAAACTTAC ACAAAGAAAA ACCCGTAACA 

34021 TTATACCCAT ACTTAAAACA GATGCCCTCA TATACATAGT AAAACTCTTG GGGGCAGTAG 

34081 TGAAGTTGGT TATTTACTGT TTTATGAAAG TGCCATTCAG CCGGGTGCAG TGGCTCATGA 

34141 CTGTAATCCC AGCACTTTGG GAGGTCGAGG CAGGCTGATC ACGAGGTCAG GAGTTCAAGA 

34201 CCAGCCTGAC CAAAATGATG AAACCCTGTC TCTACTAAAA ATACAAACAT TAGCTGGGCG 

34261 TGGTGGTGTG TGCCTGTAGT CCCAGCTACT CAGGAGGCTG GGGCAGGAGA ATCGCTTGAA 

34321 CCTGGGAGGC GGAGATTG C A GTGAGCCGAG ATCGCACCAC CGCACTCCAG CCTGGGAGAC 

34381 AGGGCGAGCT CCGTCTCGAA AAAAAAAAAC AAAAAAGTGC CGTCATAGTG ACTCAGTTTT 

34441 AAGGAATAAA TCAAGGATAT TTAACTCAAT AGACTACAGT TAGCTAACGT GACTTGCACT 

34501 GAAAGTTATA CGAATATTGG TACTTATTCC CCTGCCCCTG AAGTATGAAT TAAAGACTCC 

34 561 AAAATTCTTT TTAGAATCTT CAGAGTAAAA GCTAGAATTT GATTTTTTTA AATAATAAAA 

34621 AAATACTTTG TATCTAAATC TGGTGTATAA AATAACTTGG TGGATGATGC -TCAAGGCTA 

34681 TCCATCCCCA AATTTCTCCC TGAATGATAA AGAGAATAAA TGAATATGTC AATTCAAAAG 

34741 TTAGAAATTT GGCCGGGCAC GGTGGCTCAC TCCTGATAAT C CTTTCGG AC GCTGAGGTGG 

34801 GTGGATCGCA TGAGCTCCGG AGTTCAAGAC CAACCTGGGC AACATAGCCA GAACCCGTTT 

34861 CAATAAATAA TAGAAAAAAA TGAGCCAGGC GTGGTGGTCC CAGCTACTCA GTAGGCTGAG 

34921 GTGGGAGGAT CACTTGAGCT CAGGAGGTCG AGACTGCAGT GAGCCGTGAT CGCAGTACTG 

34981 CACACCAGCC TTGGTGTCAG ACTGAGACCC TGTCTCAACA ACAACAAAAC AAGTTAGAAA 

35041 TTTGGCTGGG CGCGGTAGCT CACGCCTGTA ATCCCAGCAC TTTGGGAGGC CAAAAAGGGC 

35101 GGATCATTTG AGGTCAGGAG TTCGAGACCA GCCTGGCCAA CATGGTGAAA CTCCATCTCT 

3 5161 ACTAAAAATA CAAAAAAAAT TAGCCGTGCA TGGTGGCATG CGCCTGTAGT CTCAGCCACT 

35221 TGGGAGGCTG AGGCAGGAAA ATTGCTTGAA CCCAGGAGGC AGAGGTTGCA GTGAGCCGAG 

3 5281 ATCATGCCAC TGCATTCCAG CCTGGGTGAT AGAGTGAGAC TCCATCTCGA GAAAAAAAAA 

35341 AAAATTCTGT ATGAACTGAA CAAAATATCC TTAAATTTTA AAATACATCT GAAAGATATT 

35401 TCAAAATATT TAGGAAAAAA ATTATAGGGA TCAGGCAAAT TCTGAGATTC CTTTTTCCCT 

35461 GCAGCAAACA TTAGGAGTGC TGCTGTTCCT AAAAACATGG TAACTGTTGC CACACCGTAT 

35521 GTTTCCTTGG CTCAGACATA AGGTTGTGTA GTTGTTATTC CAGAATAGCT AGAATAAAAA 

3 5581 TCCAGCACAT CATTTTCTTC AGCAAGTTAA CTAACCTCTC TGTGCCTTGG TTTCATAACA 
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35641 GCAACATAAG CATAACAGAA TAGCAGCAAT AGCTCCTACC TACCTCATAA GATTCTTTGG 

35701 AGGAATTAAA TTAAGATTCA GAACACAGCC TAATATCTAG TAAGTAATAA TAATTGGCTA 

35761 AAAAAATTTT CTTAAGATTA TATATATTCA TGGGGTACAA GTACAATTTT GCTACATTAA 

3 5821 TATATTGCAT TGTGGTGAAA TCAGGGCCTT CAATCCATCC CGGAAAAAAA AAGTTTTTGA 

3 5881 AAAGATTTCT GCCATGGAAA ACTTTTAATG TACAAATTCA TCCATCCAAG AAATAGAAAA 

35941 TATATAAGTA TCAACTCCAA ATCCACCATA TCTATCTCTT CTACACCTTA AACAATTACT 

36001 CAGAAATAGA ATGCTTGAGA TACCAGAATG CATGCATATC AAGTAATAAA TGCATGCAGG 

36061 ATGTCAACGC ATCCTAGGCT TTCAAATAAA ATTGTCATAC AAAATACTTT AATATTGTAG 

36121 TAACATTCTA CATGTTAGAG TGTAGAAGTT AATCGCTGAT GCAAAAAAGG AAAAGAACAC 

36181 ATTATACCCA AAGCCTACAG AGAGAATCAC AATTACAAAT ATCAGCCTGC ATGTGAAAAT 

36241 CTTTAATTTG AAAGTCAGAA ATATTTAAAT GATAGTCATT GTTAAATCAG ATTGTGGTTT 

1 GAAAAAAAGT TAGTTTAAAA CTGAGTTTAT GAAAAATTTG GGGATTTTAG AGACAGTGTT 
TTGTTTTTAA ATGTGTGTGA GTTTGTGAAG AATGTTTTAT AAAATACTGA CAGTATTATA 
AGATGACATT ATTATAATAC AACATAAGAA TTTTGGCCTG TACCTCTCAG CAGTCCTCAA 
TCACCTGCTG TACTTGACTC AATGATTATC AGAGTGGTTT GTTTTCCTTC TGTTGTGTTC 

3 6541 CCAGTTCAGG CAGCTCAGCA ATGGCCTGTG ATTCCAGCAA TTCAAATAGC TGGTAAGTAG 

36601 TTTCTTGTTT GTTTTCTCAA ATTTTCAGGG GCTTTTCTCT ACAAGTGATT TCCAGTGCAC 

36661 GCCCCTCCAC CCATTCTTTA TTCCTTTACC TTCAGGAAAA CCCTCAGCGC TGCATCTCTG 

36 721 GTCACCGGAC CACCGTGGTA CATTTACCTA TGGCCACCAG GTGTCACCCT TCTCTTTACT 

36781 ACCATGGTTT GTGAATGGTT TTGCCAGAGG TGAATAAGAA TTTAAAATGC AGGTCTTTGA 

36841 TTTTTCAAAT GTAGTTGACC TTAAGAATTT ATGAATAAAG CCAGAAAAAT TAAGCTTAAA 

36 901 AAACACCGAA AGAAAATGAG GACTTAAAAT TTCTATTAAA AAAATTAACA GGCCACAGTT 

36 961 GCTGATGTTT AGTAAATGTG TTAGTGAAAT GTGTTACTGT GAAGACTGGG GTGTTTCTTG 

3 7 021 AAATCTCAGC CCAGGTGAAA TAAAACCAAT ATAAAACAAA TGCTTACCTA ATAAATTAAT 

3 7081 TG TAACAT AT TCCTTATGAG GTAGAAGAGT AAGTGAAGCC TTATAGCAGT CTGCTTTCAG 

3 7141 TATAGTAAGA TATTAAGAGA GAAATAATTT GTCATATGCT TTCAGAATGG TTTGCTGGTA 

3 7201 AAATAACCAA TGTCTTACAA CTTAGACGAC AATGTCCCTA GAGTGAAGAA ACACGATTAA 

3 7261 TTCGGCTACC ACAGTTGAAT GAAAATATTC CGTAAGACAA AATGTAAAGA AATTAGAAGC 

3 7321 AAAATAAATG TCTCCAAAAT GACAAAGCGA TTAAGTATAT ACACAAGATG AACAAGAACT 

3 7381 TCAATAAAAT CATGCAGTAT ACAATACAAT ATACATTTAT TAAAGTATAT GCATTTTTAA 

3 7441 TGCAACAATA ATACTAACAG GTAATAGACA AGTTGTTAAT AGTTTTTCAC TGGCTAATTA 

3 7501 AATAACAGCT TTAATTGTAT TCATTTTATA GCTTTTCTAC AATGAGCGTA AATCACATTT 

3 7561 AC TTTTTTCT ACATAACTTT TCTAACCACA AAAAAAGAAA ATGGTTTAAA AGAAGAGATG 

3 7621 AGATATCTTT GCTAAAATTT AATGCCTAAA GAAGAAACTT CTGAGCTGTA TATGGTATCC 

3 7681 TGAAGCACCT GCCCTTCAAG ACAGAATGCT TGTACCACAT TTATGCAGCC AAGTGCATGT 

3 7741 AGTAACATAA AGTAAACACA TGCCATCTGG ATATATATAT TAAGACTCTT TTGACGGCTG 

3 7801 GGCAGGGTGG CTCACACCTG TAATCTCAGC ACTTTGGGAG GCCGAGGCAG GCGGATCACG 

3 7861 AGGTCAGGAG AGTTCGAGAC CAGCCTGGCC AACATGGTGA AACCCTGTCT CTACTAAAAA 

3 7921 TACAAAAATT AGCCGGGCAT GGTGGTGCAC GCCTGTAATC CCAGCTACTT GGGAGGCTGA 

3 7981 GACAGGAGAA TCGCTTGAAC CTGGGAGGCA GAGGTTACAG TGAGCCGAGA TCATGCCATT 

3 8041 GCACTCCAGC CTGGGCAATA GAGTCTCAAA AAAAAAAAAA AGACTCTTTT GAACATGGTG 

3 8101 AACTGATTTC CCAGAATCTA GCAATTCCTG AATGTCCTGG TTAGATTTTT TTTTTAATGT 

38161 GCACCGGAAC CCCAGTGGCT CCATGGAAGG ACCTGGGCAT CCTCTAAGCC ACTTGGTGGC 

3 8221 TTCCATTATA CCATCTCAAA ATGAGAGAGC TTACTCCACT TCATTGAGGG AAATACCACC 

3 8281 AGAGTTCTGA CTCCAGAGGC ACTGGCCTAG GGAGGACACC GTGTGTGAAG CCCAGCAGGG 

3 8341 CCACTAGCTG TCCCCACCAA TTACAGTCCT TGCGTAGGGT CCAAAGAAAT GAATGCCAAA 

38401 GAGAGCAACA GAGGAGCAAG GGAGTCACAT TCCAGGACCT TCCTTCAGGG ACTTTTAAAG 

3 8461 GAAACATGAC AG CTG AGGAT CAGTTGGTTG TTTTCTGCTG TTCCCCTTCA TGTGATTCAA 

3 8521 GCTCACTCAG AAGAAACACA ATGAGACAAG AGAAGAGCCA TCTCCTTCCT TCTCTATTTA 

3 8581 TTCTAGGCAT CTAAACTACT GAATGTAGTG GTGTCTGAGA TGTATCAAAC GGTCAGATTG 

3 8641 ACTGAGTTTG AAACCTGTTT CTATCACTGA CAAACTATGA GATACTCTAT ACTTCACTTT 

3 8701 CTTTTTTTTT TCATTTTTTT ATTTTTATTT TTATTTTTTT GAGATGGAGT CTCACTCTGT 

3 8761 CACCTAGGCT GGAGTGCAGT GGCGCAAACT CGGCTCACTG CAAGCTCTGC CTCCTGGGTT 

38821 CATGCCATTC TCCTGCCTCA GCCTTCCGAG TAGCTGGGAC TACAGGCGTC TGCCACCACG 
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CCCAGCTAAT TTTTTGTATT TTTATTAGAG ATGGGGTTTC ACCATGTTAG CCAGGATGGT 
CTCGATCTCC TGACCTCGTG ATCCACCCGC TTTGGCCTCC CAAAGTGCTG GGATTACAGG 
CGTGAGCCAC CGTGCCCGGC CTACTTCACT TTCTTCATTT AAAAAAGAAA TGGGGATAAT 
AGTACCTATC TCATAGAATT ATTGTAAGAA GTGCATGCAG TAATGCATGT AAGTAGGTGC 
TCAGAAGAGT CGGACACGAA GTAAGTGCTT TTATCATCCT TATCATAATT TTCATTATCA 
GAACAAGGAG AGACCAGGTA GAAAATTATT GTGATTCTTC AGGTCTGGAA TACTAGAGTA 
GCATCCCAAA TGAAGGCACC ATTAAACTTT GCAAATCTGT ATGACACCTT CATGCCAATT 
AGAAAAAACA CCTCTTCACA ACCCCTTTCA AG AT ATTTG C CTCCTACCTG CTAAAAACAC 
CCATCATACT ACCCACAGAT AGCCATGATG CTTTTTCTGG GACAGGTGCC TCTTCCATTC 
GTG CAGTGTA CAGCCTTCAT AGCTGTGCAA CTCACATCAC AATCAGATGG AAGAATCCCC 
AAGGCTTGGT GACAGATGAG TTACTGGGTA ACACAGAGAG AGG ATT C AAA GGAAAAGTTG 
AACGGGTCCA GAAAATGCAT AGATACATGT GTAAAAATCT GGTAAGGTTA TGACTAGCCA 
CGTCCCAGGG TTCAAAGCTT TTCTCAGATG TTAAAATGAA TCATGTAAGT CCCCCAAATT 
TAAGGAGTCC TCTTCCAAAA ATAGGAAATG AAATGACATA GGTGTATGTC TCTGAGGTGA 
CGGAGGAAAT GAAGGAAGCC TCTAGATGCA GCTTGAGGTT CATGAGAGAC AGTTCCAGGG 
GAGAGGTCAC AGCTAGGGAT CACCGGCATG CAGGAACTCA GAAACCTAAA TGGGGAAATC 
TTTTTGAGGA AATGAACAGA GAAGGCTAAA ATCAAGGAGT TCGTCAGGCA ATTTCTATGT 
TTAGGTTCAA CTCTCTCCTG AAACATGAAG AGCTCATAAA TGCACTCCCT CTTTGAGTCT 
?!" ^I TTTGT CTCCTTCCCA CAGTGAGTCT GCAGGCTGCG TGTCACTCAC GTTCAGCTAA 
4 0021 GACGTAGTGC CCCATGGCTC CTCCTGTGGA GACAAGAGAC CCAGGAAAGA GGCATCACAA 
4 0081 ACCTAGGCAC CATCTTGCCT CTTCTCTCTT CCTTATTTTC CTCATTCACC CATCTCAATT 
4 0141 TAGACCTGGG CACTATTGGA TTTCAAGAAC CATTATCTCT CATCTGGAAA TGCTTATTGG 
4 0201 CTTTCTAACT GGTCTCCTCA CCTCTCATCT AACTTCTTAA CAACACATTC ACCATATAAG 
GGAGATCGTG GTCCTCCTTT CTTAGGATCC TTCAATGACA CCCCAGTGAT CATAACCCAA 
TATCCCAAAA GACCCTTGGA CTCTGTATGA GCTGGCTTCT TTCTGATTCT CTTTTCCCTA 
CACCACAGAT GTTCAGGGGG T AG AAATG C A TAATTGGTGA GTGATAGCTA CGCAAACTCA 
GGGTTAAGGT ACAGTAATTA TTTCTAATCT CCCAGTATGC CTTATACTCT CCTACTTGGC 
ATGGTTGCTC CGTCTGTGTA GACCTCCCAT CATCTTCAAC CTCACCTAAT GGAATCCAGC 
TTCTCCTTCA AGATCCAGAA GGCTATCTTG ATCCCCAGCT GAATGTGATC ATTCTTTCCT 
TTGACACCCT AAG CATTTGC TTCCTGCCTG CTTTAGGACC TCATGGGGTC TTCTTTAACT 
ACATTTACTT GCTATCAATT TCATTCCCTA CCAGATTTGG GTTCTGAGAA TAGCCACAGT 
4 0741 GACTTCTCAA CCTCAAAGCC CCTGTACTAC CTTAAACAGC TCTTGCAAAA TAGTAGGTGC 
4 0801 TCTGAAGATG TTTGTTGAAT TAGAGACTTT CATTCTGGGG AGAACCATTA TTTTCTGTCT 
CCCAGGGAGC TGCTGGTGTC CCCAAAGAAT ATAAATGAGA AAAATGCTTC CCATGGATGC 
CAGATCCCCT CTGCCCCTCT TCCCACTGTG CCCTGGGGCA GAGGTACTAA GAGACTTCCC 
CCTTGTTCCT ACTCACTTGA ACCCTGCCTC TTCCTTAATA TTATGAACAA AATTCCAATG 
41041 AACAAGATGA CGACAAAAAC AGCAATTCCA CTGATGACTC CAATGACTAG GGTG CC AG AC 
41101 GGTGAGGGCT CTAAAACAGA AAAAGCAAGT TAAAGCCTTT GATTGCCACC CTCAGCCCAC 
CCCCTAACAA AGAGCAGATC CTCATCTCAC TGCCATAATT ACCTCCTCAG GCACTCCTCT 
CAACCCCCAA TAGATTTTCT CAGCTCCTGG CTCTCATCAG TCACATACCC CAGATCACAA 
TGAGGGGCTG ATCCAGGCCT GGGTGCTCCA CCTGGCACGT ATATCTCTGC TCTTCCCCAG 
GGGGTACAGC CAAGGTTATC CAGCCCTGGT AGGTCCCATC CCCATTGGGC AATACGTCTT 
TAGGTTCGAA CTCCTTGGCA TCCATTGGCT GCTTATCCTT CAGCCACTTC ATGGTGATGT 
TCTGGGGGTA GTAGTTCAAG GCCCGACACC GTAGAGTGGT CACTGAAGAG GTCACATGAT 
GTGTCACCTT CACCAAAGGA GGCACTTGAC AGGAAAGAGG AAGGATGAGG AGAGGGGATC 
TGTTTACCCT TGCCAGGAAG ACTGGAACTT TCACTTCCTT CTATAGGTTG GAGGAAGGAA 
ATACCCTTTT CAGAAAAAAA CAAGCTACAG GAGAGACACC ATTTTGTGTC CTAAGATTGG 
ACTCTAACAC AGTGTCACTT GGAGAGCAGT C AG ATCAG CT TGTTCTCCTC ACATGTAAAT 
ATACATATCT GTTACCCATG TTCTTTGTTC TGATAGATAA AATTGCCCTT TATGTGCATT 
GAAAATGATT GAATACAGAT GGTCAGTTTC ACCTGGGTCA ACCTAGGAGG CATTGTTATA 
41881 AGAAGCGGAC TTGTAAGATA GGTAGCTTCA GTGATTATTG CTATGTTCTA TGAAAGAAAC 
41941 TTTTAACCTA AAGGATTCTT CTACTCTGAT AAGTGGCCTC ACTTGATATT TTGTCCTGGT 
42 001 ATTCATATGA TAGCTGAGAT CTCTGAATTC TCTTTTTTTT TTTTTTTTTT TTTTTAAGAT 
GGAGTCTCAC TCTGCTGCCT AGGCTGGAGT GCAGTGGCGC GATCTTGGCT CAGTGCAACT 
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42121 TCCGCTTCCC AGGTTCAAGC GATGCTCCTG CCTCAGCCTT CCAATTAGCT GGGACTACAG 
42181 GTGCGCATGA CTGTGACCAG CTAATTTTTG TATTTTTTTA GAGACGGGTT TCACCATGTT 
4 2241 GGTCAGGCTG GTCTCAAACT CCTGACCTTG TGACCACCCG CCTCGGCCTC CCAAAGTGCT 
4 2301 GGGATTACAG GGGTGAGCCA CCGTGCCCGG CCTTGACATT TCTGAATTTT TAACAGGTAT 
42361 AAATATACAA AAGATTATTG GTTAAATAAA AAGCAAGGGC CATAGACACT TCCCTTTGAG 
4 2421 C C ATATG CAT G GAG AAAAG A AATTAAACCC ATGACTTGTG GCTGTCTCAT ACATCTCAAT 
424 81 TATAAGGTAG AGACTCTAGG ATTGAGAAAG TCCCTTCCCA GAATTTGGAG AGGCACACAG 
42 541 CCTCAGCCAC CTCTGAAACT CCAACCAGGG ATTCCGTGCC CTGCAACCTC CTCCACTCTG 
426 01 C C ACT AG AGT ATAGGGGCAG AAGTGTGTTT CCACCATACC TTGTTGGTCC AAAACACCTC 

4 2661 TCCCCAGCTC CAGCAACTGC TGCAGCTGTG CAGGGCAGTC CCTCTCCAGG TAGGCCCTGT 

42 721 TCTGCCTGGC CCGAATCTTG TGCCTTTCCC ACTCCAGCTT GGTGGGCCAG GCCCTGGGTT 

4 2 781 CTGCTGCTCT CCAATCCAGT GTGTCAGGGC AGAATTCAAG GTGGTCCTGC CCATCATACC 

42841 CGTACTTCCA GTAGCCCTCG GTACTGTTGT CTTCTTG CAT TTCACAGCCC AGGATG AC CT 

42 901 GCAGGGTGTG GGACTCTGGA AAAATCCCCA GCCTTGTTAA CTGCAACCAA AGGAATAGGT 
4 2 961 CCCTATTTCC ACCATCCCCA AGGACCAAAT GATCTCAGGA AGCAAATTCC TTCCCTCTTC 
4 3 021 CCTGCTCCCA CAAGACCTCA GACTTCCAGC TGTTTCCTTC AAGATGCATG AAAAGATGAA 
4 3 081 AAGCTCTGAC AACCTCAGGA AGGTGAGGCC CCCTCTCCAC ATACCCTTGC TGTGGTTGTG 
4 3141 ATTTTCCATA ATAGTCCAGA AGTCAACAGT GAACATGTGA TCCCACCCTT TCAGACTCTG 

432 01 ACTCAGCTGC AGCCACATCT GGCTTGAAAT TCTACTGGAA ACCCATGGAG TTCGGGGCTC 
4 3261 CACACGGCGA CTCTCATGAT CATAGAACAC GAACAGCTGG TCATCCACGT AGCCCAAAGC 

433 21 TTCAAACAAG GAAAGACCAA GGTCCTGCTC TGAGGCACCC ATGAAGAGGT AGTGCAGAGA 
4 33 81 GTGTGAACCT GGAGACAGAG CAACAGGCCT TAACCATGTG TAGTAGGAGG GG AG CAGGAT 
4 3441 GTTGAGGCTC CACACACCTG CATCAACTCA TACCATCAGC TGTGTCTGGT CCTCATTTTG 
43501 TGAAGGGTGA GTTGCAGTCC TGTCTTTCTT CCATATGACA GTCCTGGGTG CTCTTTCCTT 
435 61 GTGTGCTTTT CTCTGCCACA CGTGGCTGCC ACCCCCTCAC TGCCCCCAGA TCCTATTCCA 
4 3621 ATACTCATGA TTAGACAGAC TCCACTAAAG CTGGTGGATT CTAGAAAATG TTAAGGTGTG 
4 36 81 TCTAGCCATG GTAGTTGAAC TCAGGAGTTG GTGCTCAGGG CAAATTAGAC CCAAATCCTG 
4 3 741 AGGAATAATT CCTTCAGTTT TTTTTTTTTT TTTTTTTTTT TTTTTTGAGA CAGAGTCTCA 

43 8 01 CTCTATCACC CAGGCTGGAG TGCAGTGGCA CAATCTCAGC TCACTGCAAC CTGCACCTCC 
4 3861 TGGGTTCAAG GGATTCTCCT ACCTAAGCCT CCTGAAAACC TGGGACTATA GGCGTGCGCC 
4 3 921 ACCACACCAG GCTAATTTTT GTATTTTTAG TAGACATGGG GTTTCACCAT GTTGGCCAAG 
4 3981 CTTGTCTCAA ACTCCTGACC TCAAATGATC TACCTGCCTC AG C C AC C AAA GTGCTGGGAT 
4 4041 TACAGAAGTG AGCCACCGTG CCCAGCCTTG GTCCTGAATT CTTACACTGA ACTGCCTATG 
44101 TGGCCTCACC ACTTGGAAGC CTGACTGGAA TCTCAAACTT AACATGTCCA AATGCAGATC 
44161 CTTGATTTAC CCCAAACTGC TCTTTCCTCT GCCTTCACCA TCTCAGAAAT GGCATTGCCA 
44221 ATTACCCCAC TGCTCAGGCC AATAAAATTA AAATAAAGAA CAAAGTCAAC TTTAACTCTT 
442 81 CTCTTTTTCA GGGGGTCAGG GGAGACAGGG TCTTGCTCTG TCACCTAGGC TGAAGTACAG 
44341 TGGCACAGTC ATGGCTCACT GCAGCCTCAA CTTCCTGGGC TCAAGCAATA CCCTCCACCT 
444 01 CAGCCTCCCG AGTAGCTAGG ATCACAGGTG CATGCCACCA CACCCAGCTA ATTTTTGTAT 
444 61 TTTTTGTAGA GAAGGGGTTT TGCTGTGTTG CCCAGGCTGG TCTTGAACTC CTGAGCTCAG 

44 521 GAATCTGCTC TCCTTGGCCT CCTCCTTGGC ATGAGCTACT ACACCCAGCC AATTCTTCTC 
44 581 TTTCTCTCAC ACAACATAGA ATCCTTCAGC AACTTCCTTC AGAATATATT CAGGAGACAA 
44 641 TGGTTTGTCA CTCCCTTTTC TGTTCCCACC CAGCCCACTC CACTACCTCT TGCCTGGACT 
44 701 GTGTAACAGC TTCCTGGCTG GGCTCCCTGC TTTTACTGTT GCTCCCTTCA TTCTGCTTTC 
44 761 CACATAGCAG CCAGAGCAAT CTTTTAAAAG CCTGTGACAG ATCACTGTTA CTCCTTGGCT 
44 821 AGAATTCACA CCACAGCCTA CAGGCGCCTG CACAACCTTG TTTGTGGCTC CTCTTCTGAG 
44881 CCCATTACCT ACTTCTTGGC CTCTACTCCC CAGCACTACT TGTTTATTTT TTTCAACCCG 
44 941 AGCTTCTTAA CCAGGAGTTT GTCTACTAGG TGACATGTGG CAAAGTTTAG AGACATTTTT 
4 5001 GGTTGTCAAG ACTGGGGGAG TGCTCCTAGC ACCTAGTGAG TAGGGAGGAC AGGATACTGC 
4 5 061 TAGACATCCT AC ATG C AG AT GGTAGTCCCC CTTCCCACCC CCACGCCGCC CCCCCCCCCC 
4 5121 ACACACACAC ACATGAGTAG TGCTGAGAAA ACCCGCTTTT TAATCCAACT TGCCAGGCCC 
45181 ACTCAGTTTG CCTGGGAAAT ACTGCTCCCA GTCAATATCA TTCTTATTTC CTTCATGTCT 
4 5241 CTGCTCAAGT GTCAGCCCCA GAGTGACTTG CCCTGACTTC TCTGCTTCTC ACAACACCCA 
4 53 01 TGATTTCCTG ATGTTGTATA TCTTTCTGCT CATTTGCTTA TTGTCATCTC TCCCACTAGA 
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CAGCTACATT 
GAGCTAAGAT 
AAAAAAAAAT 
AGACAGGAGA 
TGCACTCCAG 
AAATGCAAAA 
TTACATTTCT 
CGCCTCCCAG 
CTACTTTCTT 
CGTGACTGGC 
TATCTATTAT 
TTGGTTTGTT 
GTTTTTTTGT 
GGGTCATATG 
GCATTTTATC 
GTTTTTGAAT 
CTTGTTGAGA 
ACCATAAATG 
TAATCCTATC 
TGGCCCAATC 
TCCCAAGCAG 
GTAGAGACGG 
CTGCCCACCT 
ATAATCCTAT 
GAATTCAAGA 
TCTCAGTCTG 
TTGTATATGT 
AAAATAGTGA 
AAACTTGTCA 
TATAGGTGCC 
TTTGGCCATT 
CAAATGGTCT 
ATCCAAGATT 
ATCCTGGCTC 
GCTGGTAAGC 
TATTTGTTGG 
GCATGAAGGG 
GAGGCTCTTG 
CAGGAATTCA 
AAATAAAATT 
GAAGATCACT 
CCAGTCTAGG 
GAAAGTGGTT 
GTGTTGTGGA 
C C ACTTGTT A 
ACTCTGTCCT 
TGGTTAAGAA 
ACAGACACAG 
ACAAACCAAA 
AGGCTACAGA 
GAGAGAATAA 
CTAAGGAACT 
TCTGTACCCT 
AAAGGTGTTT 



AGAGGCTGAG 
CGTG C CATC A 
TAGCTGGGTG 
ATCGCTTGAA 
CCTGGGCAAC 
ATTAATGGAT 
ATCTCCCCAA 
ATTCCTCCAT 
TCTATTTGGA 
TTCTTTCATT 
AAGGACATAC 
TCTACTTTAT 
AGACTTATGT 
GTAACACTGT 
CTCCTATCAG 
CAGGGCCCCA 
AGACTCTTTT 
TGAGAGTTTA 
TT^* T T T T 
CCGGCCACTG 
CTGGGATTAC 
GGTTTCACCA 
CAGCCTCCCA 
CTTTATGTCA 
AGTTTCTCAA 
CTGAATTTCC 
TTTAATATTT 
AAATCAGAAC 
TATAAACAAA 
AAAGGCTGCA 
AAGAGACTTA 
GAAGGTGAAT 
AAAAAGTTGA 
CACCTTCTGC 
TGGAAATGAC 
GGTTCAGATT 
AACTGGTATA 
CCTGTAATCC 
AGACCAGCCT 
AGTCAGGTGT 
TAAGCCTGGG 
TGACAGAATG 
GAAGATCTAC 
AAGAAATGGG 
ATCATCCTTT 
AACCCTCAGT 
GAGATTATAG 
AGGGATGATG 
CACAGGAAGC 
GGGATCTTGG 
ATTTCTTTTG 
TGATATACAT 
TCCTCCCAGT 
GAACTGGTAA 



GCAGGAGAAT 
CACTCCAGCA 
TGGTGGCATG 
CCTGGGAGGC 
AAGAGCGAAA 
TTTAGTATAT 
AAAGAAACCA 
TCTCCTCCTC 
ACATTTAGTA 
TAGCATAATG 
CACAACATAT 
GGCTATTGGG 
TTTGATTTCT 
TTAACCTTTT 
CAGTGTATGA 
GATAGAACAA 
TTCATTGAAG 
TTTCTGGAGT 
TTTGACAGAG 
GCTCCTCCTC 
AGGTACCTGC 
TGTTGGTCAG 
AAGTGCTGGG 
GGACTACACT 
CTTCAAATTT 
CTAGGAATTT 
TCATAAGAAA 
ACTGGGGGTC 
AAAGAAATGA 
GAGAAATGGT 
GAAGACTTAA 
AGATCATTTC 
CTGAACTGTT 
TGCAAGCAAA 
AAAAATTACT 
TTCATGTACA 
GGGCTGTGTT 
CAGCACTTTG 
GGGAAACATA 
GGTGGCACAC 
ACATTGAGGC 
AGACCCTGTC 
TTTTCTCTGT 
GTGAGAGCTA 
TCCACCCACT 
AGCTGTGAAC 
TGGAATAGGG 
GCCAGGTAGA 
TGCTAGAAGT 
CCCTGATAAT 
TTCTAAGCCA 
TTCTTTTACT 
GTCAACACAT 
TGAAAGAAAT 



CGCTTGAACC 
TGGGAGACAA 
CACCTGTAAT 
GGAGGTTGTG 
CTCCGTCTCA 
TTACAGAGAT 
TGTTCCCCTA 
CTCCCCTCCC 
TACATAGAGG 
TTTTTATGTA 
TTTATTTATT 
AATAGTG CTG 
TTTGGTTATA 
GAGGAATTGC 
GAGTTCTGAT 
AAATGTGGTT 
TGTTTTGGCA 
CTCAATTTTA 
CCTCACTCTA 
CCAGGTTCAA 
CACCATGCCT 
GCTGGTCTGG 
ATTACAGG C A 
GTCTTGATTA 
GATCTTTTTT 
TAGGATCTAT 
CTTTTTTCAT 
AGGCGCATTT 
CCAATCACAT 
GTCAGATATA 
GCCATAGATT 
ACCTTTAAGA 
AAGGAAGAAA 
CAGAAATGCT 
CCTGGGAAAG 
CTTGGGAAAG 
CATAAGGTCA 
GGAGGCCGAG 
GGGAGATGCT 
ACTTGTGGTC 
TGTAGTCAGC 
TCCAAAAAAA 
AAAC CTAAT A 
CGTAGATGCA 
TATGGGATGA 
CTGACCTTAT 
TGAGTCCTCC 
GATGGAGGCA 
GGAAACAGGC 
ACCTTGATCT 
CCCAGTTGAT 
GTCATAGAAG 
GGAATTCCTC 
CTCAGCATGA 



CGGGAGGCGT 
GAGCAAGACT 
TCCAGCTACT 
GTGAG CCGAG 
AAAATAAAAT 
GTGCAACCAT 
ATTCAGTACC 
AGCCCTAGAC 
CATATAATAT 
TGTTTTTCAT 
CATTCATCAG 
TTATAAACAT 
TATCTAGAAG 
CACATTCTTT 
TTCTCTCCAT 
ATTCAGTTGT 
CCCTTATCAA 
TCC CATTATG 
TTGCCCAGGT 
GCAATTCTCC 
GGTTAATTTT 
AACTCCTGAC 
TGAGCCACCA 
CTATAGCTTT 
TGGAAGACTA 
TATCAATGTC 
TTAAACTTTT 
AACAGGCAGA 
TGTGGAAGCC 
CCTGAAAATT 
GCTCAGTGAG 
GAGCAGGTAG 
CTCTAATCTT 
GAAATTCAAC 
TCAGATTTAG 
GGTTTAGCTT 
AGAGTTGAAG 
GCAGGAGGAT 
GTCTTCACAA 
CCAGCCACTC 
CATGATAGTG 
GAGCTGTATC 
AAGAATAGAG 
AAACAATACA 
ATTGCATCTC 
CTGGAATACG 
AAC CAATG AC 
GAGATTGGAG 
AAGAAAGAAT 
CAACTGGCCT 
AGTACTTTGT 
TTTTGAATCT 
TCCTTGTGCC 
GGCCAGATGC 



AGGTTGCAGT 
TCATCTCAAA 
CGGGAAGCTG 
ATCATGCCAT 
AAATAAAATA 
TACCAAAATT 
CTTAATTCAT 
AATCTTTAAT 
ATTGCTTTGC 
GGACCAATAA 
CCGATGGACA 
TTATGTACAA 
TGGGTTTGCT 
TCCAAAGTAA 
CTTTGCCTGG 
TCCACCATCA 
AAATCAATCT 
CTATAATCTA 
TGGAGTGCAG 
TGCCTCAGCC 
TGTATTTTTA 
CTCAGGTGAT 
CACCCAGACT 
TTAGTAAATT 
TATTAGCTAT 
TATTCTATTT 
TTTTTTAAGA 
AGAAGAATAA 
ATGGAGTGGT 
GTCCATTGTA 
ACCCCGAGGG 
GAAGCTATAA 
GAGCCACCCT 
ACTCACAAAG 
AATTAGGCCA 
ATAGGCACAT 
GCCAGGCATG 
GGCTTGAGCC 
AACAATTAAA 
AGGAGGTTGG 
CTACTGCACA 
CACATCCCAG 
TGACAAATGT 
TCCCCACATA 
CCCAAAAGAT 
GTGAGTTCAC 
TGGGGTCCTC 
TTATGCTGCC 
CCTTCCCCAG 
ACGTAACTGT 
TACGGCAGCC 
TTTAAGTAGG 
TTGAAAAGTG 
TGTACCTCAC 
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51841 
51901 



ACCTGTAATC TCAGCACTTC GGGAGGATGA GGCGGGCAGA TCACTTGAGG TCAGGAGTTC 

51961 ™ri£^™ TGGCCAACAT GGTGAAACCC CATCTCTACT AAAAACAAAA AATGTTATCC 

llotl rilnnn^ TGGTGCCTGT AGTCCCAGCT ACTCAGGAGG CTGAGGCAGG AGAATTGCTT 

llnll ^ CGGGA GGTGGAGGTT GCAGTGAACT GAGATCACGC CACTGCACTC TAG CCTTGGT 

52081 GAGAGAGCAA GACTTGGTCT TAAAAAAGAG AAAAGAAAAA TGAAATTTCA GCATTATAGA 

52201 rl^ T n GT TTCCCCTTCC CCCCAAACTT TAAAAAAGCA GAAGTCTGCA 

52201 GTCTTTGCCA ATGTTATTTT TATTATAACA AAGGAATCTT GCAAGGCTAC CAGATCTCAG 

\ r^Ztll^ TATGTTCTGT AAAAATCACT TCCTAAAATG TCTGAATTGA CTG CTTGTCT 

52321 CATTTATTTG TTTCTCGTGT CATACTGCAA TGGATATCTG TCTTGTTAGT ATAAATATTT 

52381 GTGCATTTTG TTGTTGTTAA AACAGCTTTT TTGGCCTGTC TTCTTCCACC TATgIgSaI 

52441 TATAAAACTC ATGTTTAACA CTTATTTTTG TAGCAGGACA AGCtIcSaC ££££££ 

till, ?2^ CTGAG TTAAAGAAGG AAGGGCTTTA TTCAGCTGGG AGCTTTGGCA A^SSS 

52561 CTCCAAAAAC CGAGCTCCCT GAGTGAGCAA TTCCTGTCCC TTTTAAGGGC TTGCAACTCT 

52621 AAGGGGGTCT GTGTGAGAGG GTCATGATCG ACTGAGCAAG TGGGGGTATG TGACTGGCAG 

52681 CTGCATGCAC CAGTAATCAG AACAGAACAG GGATTTTCAC AGTGTTTTTC CACACAATGT 

tlltl TAGATAACAT ^CGGTTAG GTCGGGGGTC AATCTTTAAC CAGACCCAGG 

528 01 GTGCAACACC AGGCTGTCTG CCTGTGGATT TCATTTCTGC CTTTTAGCTT TTACTTTTTC 

52861 TTTCTTTGGA GGCAGAAATT GGGCATAAGA CAATATGAGG GGTGGTCGCC TCACTTATTC 

52921 ACCCCCTTTG AGAATCTCAC TCATTAGTGG GAGTTCTCAC TTTTATTCTC ACTACCTATG 

52981 TCTTCTTGAA AGACAGATTG ATAATGATTC ATATAGTACA CTTGTGCTGA AGCATTTTGG 

53041 TGAGCTAAGG TAGTGATGAA GCTTTTTATC ATTTGGAGAA GTACAGGTAG CAAACAAGGA 

53101 AGCAGTAAGC AGGTTTCTAT TAATATTATA ACTCCTATTA TAAGAGTTTT AAATCTTCTT 
A ^ TCGGA ACCATTTTTG AAACATGGCC CCAGAAACAA ATCCATACCA CACCTACATG 

53221 GGCACATGTG CCACTTTTGT CATATTTCTA ACTATGTCTT CAACTACTTG CCCTTAATCA 

53281 TCTATGTGTA GACAGCAATT AGTAAGGTTA AATTTCCTAC AGACCCCTCC TTCAGTTGCT 

tllr^ AGCAAGTAGT CGAGAGCCAA TCCATTTTGA TAGATAGCAT TT TGCATCTG AGTTTCTTGC 

lilt, CAGGCCACAG TAGTCAGGGC TCTGCTGGTC TTATTAGTAA TTATTTCTAA GACAGCTTGT 

««i A JS GTATGA TTCAGTTGAG CATGTAAATG GGGGTCCCAT ATCCCCACAA GCCGTCTTGT 

53521 GCCCAAGTAG CAGGCCCATA ATATTGTATG ATTCTCTCAG GGGGCCATTC ATTATTTTTC 

53581 CAATTTTCTA TAGCTATGCT TTTTTTTTTT TTTTTTTTTT TTTTTTTTTT TTTTTTGCGG 

53641 GAAGCATATA CAGGGAAGCC CAGGAGTTTG CCTGTCTTTA TGGGCAGTAG GAAGAAAGAT 

53701 GGTTTAGTAG TGTCAATAAC ACAACTACCT GCCCACTGGT CAGGTAATTT GGCATAAGCT 

tllll ;j ATGCCCAC ATATCCAGTA TAATCCAGTG GGGGCTGTCC AGTCCCGGTG GGACTCTGGG 

53821 TGGGTCCACA CAGTTTGCAA CTTTGGGAAT TTACTAAATA GATTTTTCTT AGTGTGGTTT 

till, t^nl CCACr AGGTGGCTGT TTTTATAGTA CTATTATACA GTTTTTGCCC AAGGCAGCTG 

53941 AGTCTTCCCA CAGGAAGGGT GAAGTCCTTC CCCACTTTTG CTATACAGTA TTGTCTAATG 

ttn^ ATTGAGGCTT TTAGGACCCA GAAGTTATCA GGGTGAGTCT TTTGAGCTGG SSSSS 

54121 
54181 
54241 
54301 



— " ^*^vj x v- j. i i loavj^ibvi bAATTTATCA 

GGAACTGGGT CTGTAGGTAC TAATTCTCGT GCTTCCCATG GCCATTGATC TCCCATTACA 
GTTCCTCCAC ATACATACAT AACATGAAGT GACATTGAGA GACTGGGCTA CATGCTCAGC 
TAATTGCAAA AACAAATTTC TTGTTTTTCC TGGAATTTCT AGTACTGGCA CATTCAGTTC 
ATCATAAGAA GGTTTGAAAT ACTGGCTCAG GGGAGCATTT ATAAACTTCT CCTCAAACCA 
, CCATATTTAC TCAAGGATCC AGTCCAGCCC CAACTATTTC TAAGGTTACA CGATCCCCTT 

54361 TTTTCCAGTG AGAATCAAGG GGGTTGGTTA TTACTAGTTC TAAGGGGTTA CACTGACCAC 
lAll * GGTACAGGA AGGGCCACTT TTCCCTTTCT GAAGGTGGAC AGGATTCTTT ££££££ 
54481 ACCAAGTTGC CTAAATGACA CAAGACCAGT ATCTACATTT ATTTCCACGC AGTCTTAATT 
54541 CATGACAAGC GTACTTATTT TCTG CCATAT AGCCTCTTTC CTAATGAACA GAAcSct^ 
54601 CTATTTCTAA CTTATTACTA TTAATGACAG CACAGGCATC AAATTTCAAG GTCaSSS 
54661 TGGGCATTCC TTTTTCTTCT GTTTTGGCTA ACACTTTACT CGTATCGTTT ATGAACCCCC 
Hill ?^ CCTC AGTCCTCAAT CTTATTTCAA AAACTGTGGT CGTGGGAGGC SSSSSZ 
tllH CATAACACAC ATCAGGTTGG TCATTTCTTG GGCTACCTAC CTTGTATAGA ATAGCATTAT 
54841 ACAAACAAGT TATTTTTAGA GTCTTTGTAC ACTTATAATA ACCATAAAAT AATAAGACTG 
54901 TAGCAACTTT TTGTCCTACC TCAGTGACTT GATGTATACA CTGGGAACAG ££££££ 
54961 GAGGAAGGTT AGTTGAAGTC TTT ACTGTG C AAGTCCAAAT TTTAAGGAAA AtSSSS 
TGATGAGTTT TCTCATGTTT CGG CCATGCA TGGACCAGTC AGCTTCCGGG TGTGACTGGA 



"55021 



Figure 8 (Page 17 of 73) 



SUBSTITUTE SHPPTraill.E 9R\ 



WO 98/14466 



PCT/US97/17658 



33/162 



55081 

55141 

55201 

55261 

55321 

55381 

55441 

55501 

55561 

55621 

55681 

55741 

55801 

55861 

55921 

55981 

56041 

56101 

56161 

56221 

56281 

56341 

56401 

56461 

56521 

56581 

56641 

56701 

56761 

56821 

56881 

56941 

57001 

57061 

57121 

57181 

57241 

57301 

57361 

57421 

57481 

57541 

57601 

57661 

57721 

57781 

57841 

57901 

57961 

58021 

58081 

58141 

58201 

58261 



GCAGGGCTTG 
TCACAGTCTA 
TCCAATACCT 
GGTTTTAGGG 
TACTTGGTTA 
GAGGGCCTTT 
GGCTGTTGCT 
TTTGGAGGCC 
TTTCTCTTTC 
GGGCCGACAT 
TGTAGTTTAT 
CTGCAGCTAT 
CATCTAAATA 
TTATTTTTCC 
AAGTACAAGT 
ACCACACATC 
TAAGCTCACT 
GACAAGAAGT 
CAGGGTGCTG 
GTAGAATCCT 
AGTGGAAAGG 
TTGTTTAATG 
CCTGCTTAAT 
AATGTATGAT 
GGTC CACACA 
TCGTTGACAC 
GCTCAGTCTG 
CGACTTGGCA 
AAAATTGACT 
GAGTAATAGA 
CCTGGGACTA 
GTCCATCCCT 
CCTTCCTAGG 
GGCTGGTGCT 
TTGAGGGAAA 
GTTTTAAATG 
AATTTCCTTT 
ATATTTGACA 
GTGTGTTATT 
TTAATGTCTG 
GTTAAAGAAC 
TCACAGAAAA 
CTTTACAATT 
ATGTAGGTAA 
TACTTTCCTT 
GGCCTAATTA 
TTTTTTTCAT 
AACATCCCTT 
ATTCTT TTTT 
TAACCATTCT 
CAAGATTAGA 
GTTGAAAACC 
AGTCCAAATT 
GAACCATCTA 



TTGTCTTCTT 
CTGATGTTCA 
CTACTCAGTC 
TGTTGCAAAT 
ATCTAGCATT 
AAGTTTAGGT 
GCCAAGGCTC 
CAGCCTCGGC 
TGACACATAG 
GAGTTTTTCT 
CCAATCTACA 
TTGATTTTGG 
GATGTGAGAG 
TCCCTCTGGT 
GCCGCTCCAG 
CGCTTGGGGA 
GCATCCCTTC 
GAACTTAGTT 
GACTTTGGGA 
GGAAAACAGT 
GGATAATCTG 
TGTGGACAGA 
TATCAAAGTT 
TTTAGGAAAT 
ACATTCGACC 
TGGGGTCTTT 
AGGAGAGTCA 
AGTCCCCACA 
TGGTTATGTT 
ATAGATGAAG 
TGGCCCATGA 
TTTTCACCGT 
CTGGCTCAAG 
GGTTTACAGA 
GGAAAGTGGA 
TGGGGACATC 
AGCACCTATT 
ACGCTTCTTG 
AATGTTAAAC 
ACCATAAGGT 
AGGTTAGTGC 
ACTGTATGAT 
AAGGTTTCAA 
AAATC C AC AT 
ACATACCTTG 
CTTTTAAATT 
GACTTTCACA 
TCTTTAAACA 
ATATAAATTC 
TTTCCAAAGC 
AGTTACTATA 
TTGTAAGTTT 
AACTTAGAAT 
TCCTCCTGTC 



CAGTCACTTT 
AGGATGGTCT 
ACTTTCAACT 
TTCAATGGTT 
TGTTAGCCAA 
TTTGTCCAAG 
TTAAGCATGG 
CAGGGCCCCA 
AGTGTAAAGG 
TTTAACTCAT 
TTTTTATTAA 
GATTTAAATT 
TTGAAAGACA 
TGATGAAATG 
TTATTTGGCA 
TGAACAAAGG 
AGGTCTCCAA 
TTGGGAGATG 
TATAGCAGAG 
TACCATGCAG 
GCCCTCTGGC 
ATATTTGATC 
TGTTTTAAGT 
TACAAAAACC 
AACTATGGCA 
ATTGAAATCT 
GGAGGGACAG 
GGGTATAACA 
AATAACTAGA 
GAGTTAAATT 
CTCTGGAGGG 
ATGAACAACA 
TTTTCCTTCT 
AAATTCTAGG 
AGATAAACCA 
AGCAGTGGAC 
TTTATTAGTT 
TATGTTTATA 
TTAGTTTTAA 
AAGATTTTTA 
TTTAAGAAAA 
ACCCCTTAAC 
AACTTGCTTA 
TCTTATGCAT 
C AC ATAAAC T 
ATACAACATT 
GACAATTCTT 
ACTAGTTAAT 
TGCCTCCTCT 
GAACTTCTTT 
ATACATGTTA 
GGGATTTCAA 
TGGTATAGAT 
CTGAAGGGAG 



GCAGGCGTTG 
TGGAAGTTGG 
GGGCTTTCTG 
ATGCAGGGAT 
TGATGTATTT 
AGTTAGCTTA 
AGGCCAACCC 
CAGTCTGGGT 
GTTTTGTCAG 
GAAAAACTCA 
CTGTCACCCA 
GATCTG CTAT 
CATAAGGGTC 
CTAGGGTGAA 
GAGTGCC CAG 
CTGACTGATT 
GGAATGCTAA 
GAAGCTGGAT 
AGAGCTTGGC 
CCCATGCCTG 
CTGCCATGTG 
CATTCCAACT 
CTTTAACTTC 
GGTTGGGGCA 
TAAAAGCTCT 
CTCTGGAATA 
AGGTACTTTT 
AGGCAAGCAT 
TGGTCAGAAA 
TTTCTTAGCT 
GGTGGCACTT 
GTCTCGGTGG 
TTCCACCCTT 
GGTGGTACAT 
AGTATATAAC 
TTTATAGTCC 
TTTAGACCAA 
CCAGATAAGC 
TAAAACTCTG 
TAGACTTTTC 
ACCCGTTGTG 
TTTAGCCAAT 
AACCTTCAAA 
CCTCATAATC 
GTTTATTCAA 
TCTTACATAA 
CGACATGCCT 
TTATCTCAGG 
TTATTTCCTT 
TATGTCTGTG 
CACTGTTAAC 
TTATCCTTTG 
GGCTTTTTTT 
TTCCTCCTAG 



GCGAAGCTGC 
GCCCACTAGA 
ATACCAGGAG 
TTTCACATAG 
ATTAAAGTCA 
TCTGCCTCTT 
TTAGAAACTC 
CAAAACTCCA 
GTCAGGTAGC 
TTG CTGTTGG 
CCAAAATATT 
TCCCTGTGGG 
TTCTCTTGCT 
AGGGATAGCC 
TAAAGGTCCA 
GAGAAGCTCC 
GTTTCCTCCC 
GGCCCTCAGG 
ACGACTTATT 
GTCAACAGGA 
CACAAGCATA 
GGG CATTTGC 
TATGACCCTC 
GTCCATCCTT 
ACATCGGGGG 
AATGGTCTCA 
CTGAAGTACA 
TAAATTCAAT 
TAGAGTGAGG 
TTAGTTTGGT 
TCTTGACTCG 
TTAGCAGCAC 
TGATGAGAAC 
GTGCTAAAAG 
TTTTAAGAAG 
TTGGTGCCTT 
AGAAAGTCAA 
TAGATTTCAC 
TAGACATATT 
TTTAACCTTT 
TTTTTATTTT 
ATGTTTAGAC 
ACAATTTTTG 
CTTTTACCAA 
TAGTTTTACA 
ATTTATTTTT 
CAACTTTCTG 
ACAAGGATTT 
TTTTTTTTTT 
GACTAGACTG 
TTTTAGCAAA 
CTATTAATAA 
TTTTTTTAAT 
GTCTGGTCAG 



CACGTACAGC 
ATTAACTGAG 
CAAGGTGGCA 
CAAACTTTGG 
CCACAGCATG 
GTGCTAG CAG 
CATCTAGTTG 
ACCGCCATTT 
CCCAGGGCTG 
TTGTAATAGA 
GACTCAAATC 
ACTCCAATTG 
TTACGATGTC 
AATTGGACTA 
CCACAATACC 
TGAAAATTCT 
TGTCATGAGA 
GGTTGACCTG 
ACTCCAGGCT 
GGACCACCTT 
ACAATTGGTT 
ATC TTGGTAT 
TAGTAAAATG 
GCTCTTTAGT 
GCAAGACTCC 
GTTTACTAAG 
GAGATGTCTT 
AGTTTGAGGC 
GAAGAAGAAA 
AGGGTTTTCC 
GGTGTGATGA 
AAGGTAGGGT 
ATGATCTTCA 
ACTTTTAGTT 
TTGACCTTTT 
CTTACTGAGA 
ATGCCATTTT 
CTTTATATTG 
TATTTGATTT 
TATAATTTTT 
AATGTTCAGT 
ACAGAATTTT 
TAACCTTTTA 
AGGTATATTT 
TTTAGAAGGA 
CTAACACACA 
ACTTATTGCA 
TCCATACAAC 
CCGAGGATGA 
TCTAAGGCCA 
CTTTACTTTT 
GACCTTATTT 
TACCTGGGAG 
AGCTTTGTAT 
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GGTAATTAAG 

TTAATGTTAA 

GTGCTCACAA 

GTTGCCGCTA 

GATAGGAAGG 

GGTGTTCAGC 

GTTCCACCGG 

CTGGAGTTCC 

GACCGTCCGT 

CCTTTTAAGG 

AGCAGCGGGT 

AGGGATTTTC 

AGGTCAAAGG 

TTTCATTTCT 

GACAATATGA 

AGTAAATTGG 

GTAAAGATAT 

AAATAACCAC 

TCTCACTCCC 

TGTGGTCTCC 

CCAATATCTC 

TTCTCTATAT 

GGAAAATGGG 

TGAAAAACAA 

TGTTTGTGTC 

ACTGCTGTAA 

AATGCACAAG 

CGCGGTGGCT 

TTTGAGACCA 

GCCGGGCGTG 

CACTTAAACT 

TGGGTGACAG 

ATATATATAT 

CCAAAGCACT 

GCATTAACAA 

AATCCTCATA 

TCAAACATAT 

GAGGTGGAGG 

CCACATAGTT 

TTTGTGTATA 

TACTGCACAC 

AGAGATTGAA 

ACCCCGCCAC 

TGGCCAGGGG 

ATCATCTGAG 

TAAAAATATA 

AGCCTGAGGC 

TGCCACTGCA 

CCCACCAAAA 

CAGTGTTCAG 

TGGCTCAGGC 

TCAGGAGATC 

AAAATGAGCT 

GGAGGATCTC 



ATTTAGATCC 
ATCATCTTCT 
TGAGGTTTCC 
CAGATTGAAT 
CCTTAATGCT 
CACTGCGTTG 
GGCAATTGCC 
C CGTAGGG AT 
TAATCACCTC 
GCTTACAACT 
ACGTGACTGG 
ACAATGCTTT 
TCGATCTTTA 
CCCTTTTAAT 
GGGGTGGTCT 
CAAATATTAA 
TTCTGTGGGG 
TAGAGAC CCT 
CACCGTTACC 
CCTCCCCATA 
TCCCATATAT 
ATCCACATAT 
GAAGAGAGAA 
AAACCACACA 
AAAATTAAGA 
GGATGGTAGA 
GCCAAGAAGA 
CATGCCAGTA 
GGCTGGCCAA 
GTGGTGGGTG 
CAGGAGGCAG 
TGTGAGACTC 
ATATATATAT 
TGGTAATGAA 
ATTAGAGAGC 
CACTGCTGAT 
CCCATAAAGG 
TAAAATGAAG 
ACGTGGAAGA 
TTGTTCCTGG 
AGTAAATGGC 
GATTGTTCCC 
CCCGCTTCCC 
TTGTGGCTCA 
GTCAGAAGTT 
AAAAATTAGC 
AGGAGACTCA 
CTCCAGCCTG 
CTTTAAATCT 
GAAAGTCAGA 
CCTGTAATCC 
GAGACCAGTC 
GGGAGTGGTG 
TTGAACCCAG 



CCTGTTAGGA 
TTTTTCTTTT 
TGTAAAAGTT 
GCATTTGGGC 
TTTGGAATAT 
ATCCTCCACG 
TACCTGGGAG 
GCTCCACAGG 
TGTCTC CAAA 
CTAAGGGGGT 
GGCTGCATGC 
TCCATACAAT 
ACCAGACCCA 
TTTTACTTTT 
CCTCCCTTAA 
TAAAGTTATG 
AAAACATTTG 
AAAGTACCCA 
TGCCCAGAAG 
TGTCCACATA 
ACATATTTAT 
ACCTAACCCT 
GAAGTTATCA 
CAGAAAAAAA 
ATTCCGGTTC 
GAATTAAATG 
ACAAAACAGA 
ATCCCAGCGC 
CATTGTGAAA 
CCTATAATCC 
AGGTTGCAGT 
TGTCTCAAAA 
ATATGAAATA 
AGAAAGGTAA 
TGAATAATGC 
GGGAGTGCCC 
TGACAGGAAA 
TCACTGCACA 
ATCCGTAAGA 
CAGGTAGGCA 
CAGGCTGAGC 
TGGTCTGGGA 
ATCTTTCCTA 
CACCTGTAAT 
CCAGACCAGC 
AGGGCATGGT 
CTTGAAGCAC 
GGCAACAGAG 
ACCTATGGCC 
TGAATACCCT 
CAATCCTTCT 
TGGACAACAT 
GCGCGCACCT 
AAGG CGG AG A 



AACCTGCCGG 
TTCCTTAGGA 
ATTTTTTTAC 
CATCCGCGGG 
GCCCTGACAA 
AGGGCCTGCC 
CGCTCTCCAG 
GCAGGCCTAA 
AACCAGCTCC 
CTGCATGAGA 
ATCAGTAATC 
GTCTGGAATC 
GGGTGCGGTG 
TCTTTCTTTG 
TTTAAACAAA 
GCATAGAAAA 
TTCATTAGTT 
GGGGCTAATA 
GGAAGAGGAA 
TACCTGACCT 
CTGACCTCTC 
CTCACACACA 
AAGGATAAAT 
AAACACACAC 
AATGAAGGAT 
TCTGAATCAG 
AACTCCACAT 
TTTGGGAGGC 
CCCCATCTCT 
CAGCTACTTG 
GAGCTGAGAT 
AAAAAAAAAA 
AATGAACAAG 
AGTGATGTGT 
TCAGTATTGG 
ACTCCCTGGG 
GTGTGGGCTG 
ATATAGAGTT 
TACACACACA 
TGGAGGTTTA 
ACTGACTTCC 
CCCTGCAACT 
CCTGATTAGA 
CCCAGCACTT 
CTGGCCAACA 
GGCACACACC 
AGTGATGGAG 
TGACACTTTG 
AAATGCCTGC 
AAAATTAGAT 
TGGGAGGCCG 
GGTGAGACCG 
GTAGTCCCAG 
CTGCAGTGAG 



GTTAAGAGAA 

TACTTCTGAA 

TTTCTTCTGT 

TTACTGGGTT 

CAAAGTGCCA 

ACGTGCTGCT 

ATCTGTGTCG 

GTCGCCTAAG 

CTGAGTGAGC 

GGGT.CGTGAT 

AGAACAGAAC 

TATAGATAAC 

CCGGGCTGTT 

GAGGCAGAAA 

ATTTTCAAAG 

TAAAAATGAT 

ATCAGTTAAA 

ATAAGAAGGG 

GAGGGTGACT 

CCCCTCCCCA 

CACATATGTA 

TATAGCTGAC 

CTAGGTCATA 

AAAAAAGAAA 

CCCATGGATA 

ACGAAAGGAT 

AAAAAATGTA 

CAGGG CGGGC 

ACAAAAAATA 

GGAGGCTGAG 

CACACCATTG 

TTATATATAT 

AAATTTAGAT 

CCTTTTGCAT 

TGTGGATATG 

AATATTTTCC 

ACTGATATCC 

GGAAGCAATG 

CACACACACA 

GAGGCTTTCT 

ATGAAGGGAG 

GAATATGCAG 

ATAGCTTTTT 

TGGGAGGCTG 

TGGCGAAACC 

TGTCATCCCA 

GTTGAAGTTA 

TCTCAACAAC 

TAAAATGAGC 

GCAATGTTGG 

AGGCGACAGA 

TGTCTCTACA 

CTACTCAGGA 

CAGAGATCAT 



TTTTCAGTGG 
CCGGTGAGGT 
TAGCAAAGCA 
AAGGATTTTT 
GTTCCTTCCC 
CTGGTGAGGC 
CTCAAACTGG 
GGGCTGCCTT 
AATTCCTGTC 
TGATTGAGCA 
AGAACAGCAC 
ATAACCTGTT 
TGCCTGTGGA 
TTGGGCATAA 
TCCTACCCCA 
TGTAAAAGGC 
ATTCTGTGAA 
AGGAACACCC 
CCAGGAGAGC 
AAATATATAC 
TACCTAAACT 
CTCCAGTGGA 
CTCAGAAATG 
TTGATAAATT 
AAGTTAAGAC 
GAGTAATTAG 
TGAGGCCGGG 
CGATCAGGAG 
CAAAAAATTA 
GCAGGAGAAT 
CACTCCAGCC 
ATATATATAT 
ACAGGAAAAT 
TTAAAAGAGA 
GAGACTCAGG 
AAATATCATC 
TTCACTGAGA 
GATTAGATGT 
CACACACACC 
ACATCACACC 
ATTGAAGGTA 
AAAAAAGTAC 
CAGAAAACGT 
AGGCGGGCAG 
CCATCTCTAC 
GCTACTCGGG 
GCTGAGATCT 
AACAACAAAA 
ACCCAAGAAG 
CTGGTCACAG 
TCGCTTAAGC 
AAAACGTACA 
AGCTGAGGTG 
GCCACTACAC 
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61561 CCCAGCCTGG ATGATAGAGC CAGACCCCCA TCTCCAGAAA AAAAAAATAA AGAGAGAGAG 

61621 AGATGCAATA TTTAGGGTTC AACAAGACTG AATTTCTGAC TCCTTTCCCT ACCTCTCCAG 

61681 CATGTTAGAT TCTGGGTCCT TCATCCTAAC CCCCTGTTCA TGCCATAGCC ACCCTGTGGT 

61741 ACCAACTTTG GAAGCCTGGA TCTTCATCCC CTCATGATAA TGAGTGTCCC ATCAGGTCTC 

61801 CATGCTCAGC TTGGCAAGAG TATCTGTCTT CTCCTCATGG GACGGTCACA TTCACCCAGC 

61861 ACTGACAGGT TCCATTCCCA CTAGGGTGGC ACCCTATATG GTCTGAGTCC AGGCCTTCCT 

61921 GGTCCCTCAG TAATCTCAGC ATGGTAGCAC AATCGAAAAG GG CTAGGC AC GGCAGCACCA 

61981 TTTCCCACCA AGAGGTCTGA TGGCTCATCA CATAGACTGA AGGAGATTCT GAAGAGCAGA 

62041 GGTGGAATGA AGAATGAATC GTGGGCTCTG CTCTTCCTAG GCCTGTCTTC CTCTCTCCCG 

62101 AGATGTTAGC TAACTCATGA GAGCCAGAAA CCAACTGCAG GCTGGCCTCA GGCACTTAGG 

62161 TAGTGCTTCA GCCTCAGCAG TCCACATTCT AGGAACCCTC ATAATATGGG TTGAAGTATG 

62221 CATTCCCACA AAAATAAAGT TGTTGAAGTC CTAACCACCA GTACTGAAAT GGGAAAAGTT 

62281 CCCTTGTCCC GCTCGCATGG CATGTGATAG GAGTGTGGCT AATTTCTTCA GTGCCTGGCT 

62341 GCTCAAACCT CTAGGGGAAC ATTAAGACGG GCAGGTTGTG GGTCTCCAAC CCCATGACCC 

62401 CACCACAGTG TCTAGGGTTG AATGTTTACA GCTCCTGAAG CCACAGTGGG TGTGTGTTAC 

62461 AGGGTGCTCT TTTAGTTTTG CCATTTATAG GCAGCTGGTG TTAACCAACT C AATTAG AC C 

62521 GTCTACCTTG TCCCAAGGAC AGAAGAAGGC TTTCTGTATC CCAGGTTCTT GCCTTGGTGT 

62581 ACCGGAATAA ATCAGACCAC ACCTGGGCTT AGAGAAAGAG TGCAAGGTTT TATTAAGTGG 

62641 AGGTAGCTCT CAGCAGTTGG GCAAAGCCAA AAGTGGATGG AGTGGGAAAG TTTTCCCTTG 

62701 GAGTCAGCCA CTCAGTGGCC CAGGCTCTCC TCCAACCACC CCAGTCAAAT TCCGCCTCAT 

62761 TTTGCCAGGC AAACGTTTGT TGTGTGCTCT TCTGCCAGTG TGCTCCCCTG GACGTCCAGC 

62 821 TATTCGTGTC TTGTGGCAGG CCAGGGGAGG TCTTGGGAAA TGCAACATTT GGGCAGGAAA 
62881 AC AAAAATG C CTGTCCTCAC CGTGGTCCCT GGGCACAGGC CTGGGGGTGG AGCCCTAGCC 
62941 GGGGACCACG CCCTTCCCTT CCCCACTTCC ATATCATTTA AAGGGACCAT GCCCTTCCCT 

63 001 TCCCAGCACT TTCCCCCTCC TGTATCAGGA CCTGTGAATG TGGCCTTATT TGGAAATAGG 
63 061 GTCTTTGCAC TTCATCAGTT AAGATAAGAG TGGGCTCTAA CCCAACATAA AGGGTGTCCT 
63121 TATAAAAAGG AGAAATGTCA TACACAGAGA CTGACACCTA TAGAGAGAAA ATGTGGTGAG 
63181 TAGACACAGG GAGAATCACC ATTCAAGTCA AGCAATGAGT CTGGGGATAC CAGAAGCTGG 
63241 GAGAGAAACC TGGAACAGAT TATCCCTCAT TGCCTTCAGA AGGAATCAAA CCTGATGATA 
63301 CTTTGATTTC AGACTTCCAG CTTCCAGGAC TGTGTGACGA TAAATATCTG TTGTTAAGCC 
63361 AACGAGTTTG AGGTACTTTG TTACTGCAGC CCCAGAAAAC TAATACAGTA GGTACTATGG 
63421 ACTGAATTGA CTCCCCGTCG CAAAATTCAT ATGTTGAAAC CCTAACCCCC AGTGTGATGG 
63481 TACTTGGAGC TGGGG CGTTT GGGAAGTCAT TATATTTAGA CAAACTCATC AGGATGTGTC 
63 541 TCTCATGATG AAATTCATGC CCTTATTAAA AGAGACAACA GGCCAGGTGC AGTGGCTCAT 
636 01 GCCTGTAATC CCAGCACTTT GGGAGGCTGA GGTGGATGGA TCACCTGAGG TTGGGAGTTT 
63661 GAGACCAGCC TGGCCAACAT GGTAAAACCC CATGTCTACT AAAAATACAA AAATTGGCCA 
63 721 GGTGTGGTGG TGCACGCTTG TACTCCCAGC TACCTGGGAG GCTGAGGCAG GAGAATCCCT 
63 781 TGAAACCAGG AGGTGGAAGT TGCAGTGAGA TCACACCACT GTACTCTAGC CTGGGTGATA 
63841 GAGACTCCAT CTCAAAAAAA AAAAAAAAAA AGACAATAGA GCCAGGTGCT GCAGCTGATG 
63 901 CCTGTAATTC CAACACTATG AGAGGCTGAA GCAGGAGGCT CGCTTTAGCC CAGGAGTTCA 

63 961 AGACCAGCTT GGACAAAATA GTGAGAC CCC CAACTTCTAA AAATTTAAAA AATGAACTGG 

64 021 GTGTGGTGGT ACACATCTGA GGCTCCAGCT ACTCTGGAGG CTGAGGTGGG AGGATTGCTT 
64081 GAGCCCAGGA GGAGGCTGCA GTGAGCCATT GCTGTCCAGC CTGGGCTACA CGAGAACCTG 
64141 TCTCGGGAAA AGGAGAAAAC AGTGAGACCT CTTTTTCTCT CCTCCTTCTC TCCACTGCCT 
64201 AAGCCCTACA AGCACAAAAA GGACACCACA TGAGCACATA GTGAGAATGC TGCTGCCACC 
64261 AACAAGTCAG GAAGAGAGCG TTCACCTAGA AACTGAATTG GCCAGCACCT GGATCTTGGA 
64321 CTTCTGAGCT TCCAGAACTG TGAGAAAGTT ATTTTTTTTT TAGCGACTAA GTCTATAGTA 
64 381 TTTTATTACA GCAGCTCAAG GTAACTAACA TAGTAGAAGG GATGAATTAT GGAGATCACA 
64441 AGTCCACGCC TCCAGAAAAA GACTTCCCTA AAAATTAGTC TGAGCAAAAT TCGAATGATG 
64 501 AATTATTTTT AAGAACTTTT AAGGGATCTG ACAAGTTTGC AAGAGCTAGA GAATGCTTTA 
64 561 CAACGTGATA ATAGAATGCT CTGTGATGAC AGAAATCTTT CCACACTGTT CAAAACTAGC 
64621 TACTGGCCAC TTGTGACTAT TGTGCACTTG AAATGTGACT GGTGTCTGAG GAGCAGAATG 
64681 TTTAATTTTA CTTAATTTTA ATTCATTACA ATAGCTACAT GTAGCTAGGG GCTACTGGAT 
64741 TGAACAGCAC AGCTCGAGTC TTTTAGAGGG AGACAGGACT CACCAAGATG GATGCTGGTG 
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GTTAATGACC 
GTAAATATAT 
TTTTAAATTT 
TTAGAGTGAA 
CCTCCTGCCT 
AGCTAATCAC 
GCAACCCATT 
GAGGTGAGGA 
TTGAAATATG 
AAGCCAGGCA 
CCTGTATTGT 
GCCAGTGGGG 
TAAATCATTC 
AAGATGACAG 
TTTGGGAGGC 
CATAGTGAGA 
AAAGTGAAAG 
CACTATCCTT 
TGGACTTGAC 
AGTG TGGC AT 
GAATACACAA 
CTTGAGGACA 
GTCATTCTTT 
GCAATCTCAG 
TTCCAAGTAG 
GAAGAGACGG 
GCCCACCTCA 
GGTCATTCTA 
AATTTGAGAA 
TAGTCAAAAG 
GCTCTTAGGC 
TGACTGACAT 
T CATTAATG A 
ATATGTATTT 
GTTGTTTAAT 
AAGAATTCAT 
AATGTTTACT 
TCACGTAGGA 



AGTTTTACAA 
GGAACTTTTG 
TATGGTATAT 
TAAATAAATA 
TTAAATAATT 
ATTTTTTGGA 
AGAACCACTA 
TTGAAGGCAA 
AGTGGCTGTG 
TGTCCCACCA 
ATTTCTCCTG 
TAAAAAACTA 
TTCTTCCAAT 
ATATACGTTT 
TAAAAATGTG 
TCTGAAATGC 
CAGTTTCCTA 
TTTGCAATAT 
TATATTTATG 
GTGTTGTGAT 
CAGCCTCATG 
TATTTATTAT 
TTGAGGGCGT 
AATATTTAGC 
AGCAAAACTG 
ACAGACAAGA 
TGGTCTCAAG 
TGGCTGTTAT 
AAAGTTCTGG 
AACCCTACTC 
TGAGGCAGGA 
CCCCTGTCTC 
AAAAACCAGC 
TGAATATATC 
ATTTTGCTCT 
GAAACAAAAC 
TATATTGCAT 
TCAGTTTTGG 
TTTTTTTGAA 
CTCACTGCAA 
CTGGGATTAC 
AATTTCACCA 
GCCTCCTAAA 
ATAGACTTTT 
ATACAGTTTC 
CTTTGTATCT 
GAAATTGAAC 
ATATGTACTA 
AGGAGGAAGC 
GGTCAGGGAA 
GACTTATTAG 
TTCCATCTCT 
TTTAGATAGT 
CAGTGTTGTC 
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74 521 CAATAAGGTA CCACTAGCTA CACGTGATCA TTGACCATTT GGACTATAGC TAGACTGATT 

74 581 TAAAATGTTC TAAAAGTGTA AAATACACAC CAGGTTCTGA AGATTTATCA TTTAAAAAAG 

74641 AATGTCAACT GTCTTTTTTT TTAGCTTATT TATTATATGT TGAAGTGATA ATAGTTTAGA 

74 7 01 TATATTAAGT TAAATAAAAT ATCTTAAAAT TAATTTTACT TGTTTCTTTT CATTCTTTCA 

74 761 ATGTGACCAC TAGAAATCTG GAAAGTATTT ATGTGATTCA CATTCTATTT TACTGTCTAG 

74 821 TATTGCCTTA CATCATCAGG TACCCCATAA GTAGGCTTTT TAGATAATTC TCTAATATAG 

74 881 CTTGGAAGGA TATGGAGAAA TATTTTTGCG TTGCTTTTAA GTTTTGCATA ACTTTTTCAA 

74 941 CACACTTTAT AAAGGATCTA GAAAAGGGTT GGTTACATGT TTCTCTGTCT TCTGGCCTCC 

7 5 001 ACCATGTTGC CAGGAGGTTG GGGACAAGAT TCTGGGTGGC TGGATGTCCT AATGGCTTGA 

7 5061 GGTCTGGACT TGAGATTTGC ATATAAAGAG ATGTGATTAG ATTGAGTCGA CTAGAAAAAT 

7 5121 CATATTAGAG AACTGAATCA CAGCGATTAA ATTTACATGT CGATTTATAA ACCAGGACAC 

7 5181 CAATTTATAG TGAAAGAAGG TCCAGTTACC TGGTAATCAA GACGTTTCAT AGCTATTTTC 

7 5241 ATGATGGATA TACTTAGCTG AGTTTTAAAT GAGAAGGGGG TTCATTGCAC ATAGAATAAG 

7 5301 ATCTAAGTGA AATGTTTATT TTATTTTTTT TTTTTTGACA TGGAGTCTTG CTCTGTTGCC 

7 5361 CAGGCTGGAG TGCAATGAGG CAATCTCGGC TTCTGGAGTG CAATGAGGCA ATCTCGGCTT 

7 5421 CTGGAGTGCA ACGAGGCAAT CTCGGCTCAC TGCAACCTCC ACCTCCCGGG TTCAAATGAT 

7 5481 TCTCCTGCCT CAGTTTCCTG AGTAGCTGGG ATTAGAGTTG CCTGCCACCA CGCCAGGCTA 

7 5541 ATTTTTGTAT TTTTTTTAGT AGAGATGGGG TTTCACCATG CTGGCCAGGC TGGTCTCGAA 

7 5601 CTCCTGACCT CAGGCGATCT GCCCGCCTCA GCCTCCCAAA GTGCTAGGAT TACAGGCGTG 

7 5661 AGCCACCAAG CCTGGCCTAA GTGACATGTT CTTATATTGT TCCTTTCTTT CTTTTTTTTT 

7 5721 CGACTGAGTC TCACCCTGTT GCACAGGCTG GAGTG CAGTG GCGTCATTTC GGCTCATTGC 

75781 AACCTCTGCT TCCCGGGTTC AAGCGATTCC CTTGCCTCAG CCTCCTGAGT GCCACCACCC 

7 5841 C CAG CTAATT TTTGTACTTT TAGTAGAGAT GGTGTTTCAC CATGTCGGCT AGGCTGATCT 

7 5901 CAAACTCCTG GCCTCAGGTG ATCCGCCCCC GAGTCTCCCA AAGTGCTAGG ATTACAGGCG 

7 5961 TGGGCCACGG GGCCCAGCCT TATATTATTT CTTTTACTAC AATATATTAG TATGATGCAG 

7 6021 GTGCTTCAAT TGTTTATACA CTTTCCATAA TTTTGTATAA TTCTTATACC CTGTCACTCT 

7 6081 GAGGAATAGC CGGTCTAAGT GTTTTTC C AC CACTGCTAAT TCATCCATCA CTAATCTCAT 

7 6141 TAGACTGTTA ATTCCCAGAG GACATAAGCA CACAAGCAGA CAATGTTTAC AAATGT TGGA 

76201 CAAATGTTAT TTAATAAAAC AATGGGGTCA CCCTTAGTCT AAAAGATGTT TCACTTTTCA 

76261 TTTGTCATTG AACTCTTATT TGTAGGTTCC CTTTTGACTT TCCCACAATC TAAGGCTGTT 

76321 CTCTTTAACA CATATTTTCA TGAAAACATA TATTTGAGCA GAAATTGTTG GGGAGTTGTA 

76381 ATATTACCTT TGTCCCTAAA TATGAATCTA TAATTATATC AAATATATGG GCAGACAATT 

76441 TACTTTGCCT TTAATCTCAA GAAAAAAATA GCAATTACTT GGGGTCGGAG AGTAAAATAA 

76501 GAAGTAGTGA ACCTTAAAGT AGCAAACTTT AGAACAGAAT AGTTTCAGAG GGGATGAGAA 

7 6 561 GAGGTGATTT TTCAG CTC AT CAACAACAGA TCTTATAATA AATTACATGT TCTGGTACTT 

76621 TTCTTGTCTT TCTGTGTTAA ATTTTGCTAT TTAAAAAAAT AAATTTCAAA TACATTGTTC 

76681 ATCTTAAAAG TCAAGAGTGT GTTTTATTAA AGTCAGTTGC TTTATTTGCA ACTCAAAAGA 

76741 TATATTTGAG TTCCCAACTG GAGATTGTCC TATATGGTAA CTTG CGTAAG GTATGGTTAC 

76801 TGAAAGTAAC CTACAATTTT CATGGGCTGA AATTCATTTC TATATTGCAG CGTACAAAAA 

76861 TAAATAAATA AAAAATG CTT GTTTTCTTTG AAAACATATT ATCTCAGTGC CTCTAACTGC 

76 921 CAAATCTATT GGCTTTTTTG CAGGCTTAAG GGCTCTCCCT TGTTCCTTTA TGATCTCTAT 

76 981 CTTGAGGGCC AGACCTCCTG CCTTACACAA CTCAGAGGGG GACCTCAGAG CTCTTTAAAA 

7 7 041 AGAGCCCAAT TTCTCGCCTG TAGAGAAGTG AAAAGGATGC CCCACCCCCA TCTATGAAAA 

7 7101 GAGGGATTTG ATAGTTTCAA TGTCTTCAAA TCAAAGATTT AAGTCTGTAG CCCCCCACCA 

7 7161 CCCCGGACCC TAG CAAGG CT CATGAACCCC CTCCCATCCC GCCCTAATTG CTTTGGACTG 

77221 GCCGTGGAAT CCTTGTCCCA GTCCACAGTT CCTGTGCGAC TGCACGAAGA ATTCACAGAG 

77281 GACCTGTGTT ACTTCCCTTG TGAAGAAACA GAATTATCAT GAAAATTTAG GTGGAAACCA 

7 7341 TTTCGCTTTT TTCTTCAAAA ATAAGGGAAG CATGTGCCCA ACCACCCCTG GGAAAAAGAA 

7 7401 CCTTCAGGGG CAAAGGAGCG AACAGGTAAT TTATAAGAAA AACAGAAAGT GGTCTCTGAC 

7 7461 TGCCCCAGAC TTCCTTCGGA GTTGGGGGAA TTGGGGACGC CTGGACGCGT TGTTTTTGTG 

77521 TTTGTGGAAA AAATAAATGA AGAGCATGAA GCCCGAGGCT TCTGAGATCC TTTCCTGACC 

77581 AAACCCAAGT GATTTGGTGC GGGGAATTTT AATATTTTTC CCCTTTTGTG AGGTGGAACA 

7 7641 AACACAACTT GGGAGCAGCG CAGCGGCTCA GAGCCTGCCA GCCAGGCGGG CGACCAGAGC 

77701 ACCAATCAGA GCGCGCCTGC GCTCTATATA TACAGCGGCC CTGCCCAGGC GCTGCTTCAT 
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CGGCGCTTTG 

CGCTCCCGCT 

GGCTGGGGGT 

TGTGGCCGCC 

TGCCGCCGGC 

GGTGAGCAAG 

CAACAAGAAG 

ACCTAAGAAG 

GAAGAAGAGC 

CAAGAAAGTG 

AAGTGCTGCT 

GGCGGCGCCC 

GAGCCACCAC 

TTCTGCCCTG 

TGACGGGGTT 

TAGGTCCCTG 

CAGGCCTGTG 

TGTCCGGATT 

TTCTAGTACA 

TGTTTTTTGC 

GGATTTCGGA 

AACATCCAGC 

TAATTTCTCA 

ATAGCCCTTC 

TGTAGCCTTC 

AACTTTTAGC 

CAGCAACATT 

CTATGCAGAA 

ATTTCTAATA 

TACAGAAAGC 

AGACCTAAAT 

C C ACAATG AT 

GAAAGAGAAA 

GAAAACATTC 

TTAACATGGT 

G AC CATTTAT 

TCCTATCAAT 

ACTGTCTTAG 

GTAGAATTGT 

GTGACAAATG 

CTACTTCACA 

TCACATCATC 

TATTTATATT 

GCTGTGGCAC 

CCTCCGCCTC 

TATTTTTAGT 

CAGGTTATCC 

CACAGACTCA 

ATTGCTAATC 

AGAAAAAAAC 

TGCAGTTTAT 

TGTTGTAACG 

AGAATTTTCC 

CTCCTCCTGT 



CCACTTGTAC 

GCCGCGCCTC 

ACGCCTCGTA 

TCTAAAGAGC 

TATGATGTGG 

GGCACTCTGG 

GCAGCCTCCG 

CCAGTTGGGG 

GCTAAGAAAA 

GCTAAGAGCC 

AAGGCTGTGA 

AAGAAGAAAT 

TGATCTCAAT 

TTACTTAAGG 

GGAGAGTGGC 

ACCGGAGGCT 

AACGGCAGAA 

GGACTAAAAA 

TGACTTTCAT 

TTTACTGTGA 

CGCTTTCCAT 

CCTGGGAGGA 

TTCCTGTGGC 

CTCCACCCAA 

CGCTAAACTG 

CAGCCATTTT 

TAAAAATCGA 

AACAGTATTT 

AAGGCCTTCG 

CTAGCGTCTT 

TATGGGGACT 

TAATATAGTG 

GGGGAGGGAG 

TCCCATGGTT 

GAACCCTCTA 

GAACTTTCAT 

TTTGGCTGTT 

GTTTCTCAGG 

ATTGTTTTAA 

GCAAGTGTTC 

ATGCCTACTC 

ACAAGTAAAA 

TATTTATTTA 

GTTCTCGGCT 

CCGAGTAGCT 

AGAGACGGGG 

GCCCACCTCA 

AATCATTTTT 

TCTTACAGTG 

AGTGTATATA 

TAAACATGCA 

TGACTTTAAT 

TGGTTATTCC 

TTCTCCATCT 



CCGAGTTTTT 

CTGCGGAGAA 

AGGCGTCTGG 

GTAGCGGAGT 

AGAAAAACAA 

TGCAAACGAA 

GGGAAGCCAA 

CAG CCAAGAA 

CACCGAAGAA 

CAAAGAAGGC 

AGCCCAAGGC 

AGGCGAACGC 

AAAAGAGCTG 

TTAGTCGTAT 

CGTGGTGAGG 

TTTCTCGCTG 

AAGACCGCAA 

ATTTTCAAAA 

TCTGTATTTA 

CTTAAAAGTT 

GTTGTTGGTA 

GAGTGCGTGC 

AACGAAGGAA 

GGCAATCGTG 

ACAGGTTTGA 

GTCCTCGCAT 

AGTTCCTTTA 

GTACTATTAA 

TTAATGGTTC 

ATATTCGCTT 

GGGGCTTCTG 

AGTTGATTTG 

GCAAGCAGAG 

TAAGTAATTT 

TTTTGGTGTA 

TTCTGCTTCC 

TTGTCATAGG 

TTTCTATTTT 

ACATTGTGTT 

AACTAATACC 

CATTCACCTC 

CGGTAAGCTA 

TTTATGAGAC 

CACTGCAACC 

GAGATTACAG 

TTTCACTAAG 

TCCTGCCAAA 

ATTACAGTAT 

CCTGATTTAT 

CGGTTCAGTA 

TTTACATTAG 

AGCAGATAGA 

ATTTTTTATT 

CAACATCAAA 



GATTCTCAAC 

GGCCCCTGTA 

TCCCCCGGTG 

TTCTCTGGCT 

CAGCCGTATC 

AGGCACCGGT 

GCCCAAGGTT 

GCCCAAGAAG 

AGCGAAGAAG 

CAAGGTTGCG 

CGCTAAGCCC 

CTACTTCTAA 

GATAATTTCT 

GGGAGTTACT 

TTACAGCATT 

GCGGATGGTT 

AACAAGAGCC 

GTCCCGCCCT 

ATTGGATGGT 

TTGCCTCTTT 

GTCAAGTTGA 

AGGTACCTTT 

TGCATTTAAA 

GACCTAGGGA 

G CGTATCG AT 

GACTACGGTT 

AACGTATTTT 

CTATGAAGAG 

CCTCTGTTTG 

CTTTTAAAAT 

GAGATAAGCT 

TTAGTGATAG 

AGACAGGAAG 

TGTGTTGTTA 

AGGTTTAACA 

CCCTTCTTCC 

CTAATACGCT 

GTTCCTTTAG 

GTGTGCTATC 

TAAATCTGTA 

ACTTTATCTC 

TTTTGAGAGA 

GGAGTTTCCC 

TCCGCCTCAC 

GGGCCTGCCA 

TTGGCCAGGC 

GTGCTTAGAT 

ATTGTTATAA 

AAATTAAATT 

CTATCTGTGG 

TCTCCCCTTT 

GCTAATTTTC 

TTTCCATATG 

CAATTAAAAA 



ATGTCCGAGA 

AAGAAGAAGG 

TCAGAGCTCA 

GCTCTGAAAA 

AAACTTGGTC 

GCTTCTGGCT 

AAAAAGGCGG 

GCGGCTGGCG 

CCGGCCGCGG 

AAGCCCAAGA 

AAGGTTGTCA 

AACCCAAAAG 

TTACTATCTG 

GAGGTATCAG 

TAAACCTTTA 

TTGGGATGGC 

AGTTTCTTAG 

GCTCCCGGGT 

GGAAGACGTT 

TCTCTTTATA 

TGTCTCCTGG 

GTCCTACATT 

AAACAGC CAC 

GTTTTTTGTG 

TTTGAGCGTA 

GCTTATCCTG 

GTTTGGCAGT 

TGTATGGATA 

ACATCCATGG 

CTGGTGGGCA 

GCTCAATTAT 

TGACCACGGA 

ACAGAGGCAG 

ATTTTACATT 

TATGGACATA 

TCCCGTGCCA 

ATAATTTCAT 

TCATTCCCAC 

CTCAATGCTG 

GTATCTTATC 

ATTACTGGCA 

GATCACAGTC 

TCTGTCACCC 

GGGTTCAAGC 

CCATGCCCGG 

TGGTCTCGAA 

TACAGGCGTG 

TTGTTGTTTT 

CATCATTGCC 

TTTCAGGCAT 

GGGAGACTAA 

TCTCATTACT 

TATATTAAGA 

AAAAAAAAAG 



CTGCTCCTGC 

CGGCCAAAAA 

TCACCAAGGC 

AAGCGTTGGC 

TCAAGAGCCT 

CCTTTAAACT 

GCGGAACCAA 

GCGCAACTCC 

CCACTGTAAC 

AAGCTGCCAA 

AGCCTAAGAA 

GCTCTTTTCA 

CCTTTTCTTG 

ACGAATTGGG 

TTGCGGCTTC 

AGTCCCGCCC 

TCTAAAGGGA 

TGGTCCGTTC 

GCTTATTCTG 

TTAATGTCTG 

AGGTAGTGGC 

CCTCTGCTGT 

AAC AG CGGCA 

CCACATAACA 

TCGAAAGCAC 

TTTAGACAGA 

CCAAATGTTT 

AATGGGAGAC 

TGCTTCTGAA 

CATTTTGGTG 

TCTACCATCT 

TTCATCCCAA 

GGAAGAAGGA 

ACAACACGGT 

TTTTTCCCAA 

CCCTCCACGC 

GGACAGTTGG 

AATT CTTAAG 

AGATGATTAT 

AAGCCTAATG 

TTCTGTCATC 

ATATAATTTA 

AGGCTGGAGT 

GATTCTCCTG 

CTAATTTTTG 

CTCCTGACCT 

AACCACCGTT 

ATT AT C AGTT 

ATGTGTATAT 

CCACTGGGGG 

TTAACTGAGA 

CTTCTTTTTC 

TCTCTTCCAC 

GCTGGGCGCG 
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GTGGCTCACG 

GGAGTTCAAG 

TTAGCCAACC 

ATTGCTTGAA 

TGACACAGCG 

TTCCAGCACT 

CCTGGCCATG 

TCACACACCT 

GAGGCAGAGG 

AGACTCTCTC 

GTTCCTTTCT 

CAAGAGTTAG 

TCAATGCCCT 

GAATGTCTTC 

AATTTATAGA 

TGCATATGTT 

TTCACATAGC 

TTTTAAGATG 

ACACATGTTT 

AGGACAGATT 

AGTTCCCAAA 

ACCATTCCAA 

CAAGGAAATA 

TTGTAGTAAG 

AGAATTGCTC 

TCACCTTCAC 

GATTTTCAAT 

GACATCCTGA 

GATTTTTAAA 

CAGAAACACT 

AAC C ATG AAA 

GAGGCTACTT 

TTGAGACATA 

TGCAGCCTCC 

ATTACAGGCA 

TTCGCCATGT 

CTCCCAAAGT 

TCATACACAG 

GCTCTGGATA 

ATTCATACCA 

GAAGAGAATA 

AGCACTTATT 

CATTCTTGTC 

CTAAAGCTTG 

GTGCATTTTT 

TAAACTTTGT 

GTTAGCCCTT 

TTTTGACTTT 

GGGCAGTGGC 

TGCCTCAGCC 

TGTATTTTTA 

CTCATGATCC 

CCCGGCCTTA 

CAACATTCTT 



CCTATAATCC 
ACCAGCCTCG 
ATGGTGGCAG 
CCTGGGAGGC 
AGACTCCGTC 
TTGGGAGGCT 
AAAATACAGC 
GTAATCCTAG 
TTGCAGTGAG 
TCAAAAAACT 
CCCTTAGATA 
TCAGGAGAGG 
TTGGGGTCTT 
TGCAAACATG 
GTTAAAAATT 
TTCTCAATTT 
TTACTGGCTT 
AAAAAGATTC 
TTCTGTACTC 
AACATGCGAA 
GAAAAAAAAA 
CAAAGGAAAG 
AATACATGGG 
GTTTGTTTTT 
TCTTTTTCCT 
AAAG GGAAAT 
TGCCTTCAGC 
TATTCTTCAA 
TTAGTTTTAT 
GCTGATAAGC 
ATTTATGACA 
GTAATGCATT 
GTCTCTCTCT 
ACTTCCCCGG 
CCTGACACCA 
TTGCCAGGCT 
G CTAGG ATT A 
TGCTATCATG 
TTTTGGCTAT 
GAGATGAACA 
GGGATTTAGG 
TACAATATG C 
ACAGCACTTT 
GTGTCATTAA 
TTTTAAATTT 
TCCTCTAAAG 
CTTAATAGAA 



GTGATCTCGG 
TC CTTAGT AG 
TTAGAGACAG 
GCCTGCCTTG 
TTTTGCCTTC 
CACCAAAAAT 



CAGCTCTTTG 

CCAAGATGGT 

GCGCCTGTAA 

GGAGGTTGCA 

ATAAAAAAAA 

GAGTCAGGCA 

CTGGCCATGA 

CTACTCGGGA 

TTAAGATGAC 

AAATAAATAA 

CTTTCATGGC 

AATCAACCCA 

AATCCATTTG 

TTTCCACAGA 

AGAAAAATTT 

TGTTCATGCT 

AGGTCTAATG 

TTGCCTCAAT 

TTAGATTCAC 

AAAAAGAGCA 

TTGAAACCTT 

GGAGTTTGCA 

CAATAAAAAC 

GCAGAGTCAT 

GGTATAGCAG 

TTGGGTAAAG 

TGAAAATAAC 

AACTTATATT 

AAAATAATTT 

CAAAAACATC 

TTGTTCTTGT 

ATTCCAAACT 

GTCACCCAGG 

TTCAAGCAAT 

AACCCGGCTA 

AGTCTCGAAC 

CAGGCGTGAG 

GCTACAAATT 

ATAAGCCTGA 

GGCCCAGTGC 

GTACAGTGGC 

CAAGCACTGT 

GAAGTAAGTG 

GGATGTAGCT 

AAAGTCAATA 

AGCTGGAGTC 

CTGATGCTTA 

TTTTTTTGAG 

CTCGCTGCAA 

CTGGGAC C AC 

GGTTTCACTA 

GCCTCTCAAA 

TTTAATCTCC 

CTTTGGGATT 



GGAGGCCTAG 

GAAATCCCGT 

TCCCGGCTAC 

GTGAGGCGAG 

AAAGCCGGAA 

GATTACCTGA 

AAACACACAA 

GGCTGAGACA 

GCCACTGCAC 

AAATAAAGTT 

TACCCATTTA 

AG C AAAAAT A 

ATTTATGTAC 

TGAAACTCGT 

TCAATTCTAT 

CTTTAGTTTT 

AACCATTCAT 

TTTACTTAGT 

TAAGTAGTGT 

TGCAATTTTA 

AAAAACGCGG 

CTTCATGGGA 

CATGGAAGAT 

CTCAGTGCCA 

TTGGGGACAC 

AGAAGACAGA 

TTTTATGCCA 

TAATTTCACA 

TGAAAAACGG 

AATGAATATT 

GTGATAAAAC 

TTCTGTTTTT 

TTGGAGTG C A 

TCTCCTGCCT 

ATTTTTTTGT 

TCCTGACCTC 

CCACCATGCC 

GAAGTATCAT 

GGGAAATGTA 

AAGACAGAAT 

AACAACAGTT 

TGCTGATTAC 

CCATTGTCAT 

AGTTAGCTGT 

AATTTTTATT 

AAAATGTATC 

ATCCACAGTT 

ACGGAGTCTC 

CCTCTGCCTC 

AGGCGCATGC 

TGTTGGCCAG 

GTG CTGGG AT 

ATTTGAACAT 

TAATTTCTTC 



GCGGGTGGAT 

CTCTACTAAA 

TCGGGAGGCT 

ACCTTGCACT 

GCAGTGGCTC 

GGTCAGGAGT 

TAAATTAGCT 

GGAGAATCAC 

TCCATCTGGG 

ATGGTACATT 

ATTGATGTTC 

GCTGATTTTC 

TTTCAATTAA 

CAAATGAAAC 

TTGGCCTTTA 

GTTTTATTCC 

TTGGAAATTA 

TTTTGAAACT 

CTTGCAAATT 

TTAGTATATT 

TTAGACTCAC 

TGACGAATTT 

AAAATGAAAG 

ACCTTCCATA 

TTTTACAAGG 

GACCTCTTCC 

AAGTAGAATA 

TTAGTAATTA 

TAATAATATT 

G C AT AAAC AA 

TATGAGTAAC 

TATTTATTTA 

ATGGCGTGAT 

CAGCCTCCTG 

ATTTTTAGTA 

AGTGATCCAC 

CGGCGCATTA 

ATTATACACT 

GTAAGGACAT 

TACATCACTA 

TTGGGAACTA 

TCTATATTTA 

TCCCACTTCA 

GTGTGTGTGT 

TGAAGAATTT 

TTCAAAAGAT 

GTCAGCCCAC 

TCACTGTCAC 

CCGGGTTCAA 

CATCGTGCTC 

GCTGATCTCA 

TACAGGTGTG 

ACACATACTG 

AACCACTTTA 



CACGAGGTCA 

AGTATAAAAA 

GAGGCAGAGA 

CCAGCCTGGG 

ACGCCTGTAA 

TCAGGACCAG 

GGGCGTGGTG 

TTGAACCCAG 

CGACAGAGCC 

GAACTTCTGT 

TTATCATCTC 

TAATTTTCCT 

TCCTAACCTC 

ACATTCCTTT 

GATTCAGTCT 

ATCACAATTG 

AAATTGGCCA 

GTCAATGAGG 

TAACTGACAA 

ACATGCACAG 

AGACTTATAC 

GGGAATGTGA 

ATAGAAATAA 

TCTAGTGATA 

GAAATTTCTG 

TACACCTGTT 

ATTTGGGGGT 

TATCATTTTT 

CAAATAATTC 

CTGATAATTC 

ATAAAAACTA 

TTTATTTATT 

CTTGGTTCAC 

AGTAACTGGG 

GAGACGGGGT 

CTACCTCGGC 

TTCCAAACTT 

CCTAGGCAAA 

TGTGGTTGAA 

AAGGATATCA 

GCATTTTTTG 

TTTTCAAACA 

GGGTGAAGGA 

GTGTGTGTGT 

CACATCAAGG 

TCATCTTCAA 

AGTTCTTTTA 

CCAGGCTGCT 

GTGATTCTCC 

GGCTAATTTT 

AACTCCTGAC 

AGCCACTGCA 

ATGAAAACTA 

CTTTGGGGTC 
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ATTTTAAGAT 

AATGAGTTCC 

G AATATG CCA 

TTTTGATCTG 

AGGGTCCACA 

AATCTATTTT 

ATTAAAATAT 

TTACACTTAA 

ATGGTTAGTG 

ATACTTGCTC 

CTGAAACCCA 

CCCAACGGGC 

GCAATAAGGA 

TGAGAATTTG 

GAGTGCTGCT 

TAAATCAGTT 

ATATGGGTGG 

TCTTAGGTTT 

GCTTCCAGCT 

TGCAAAAGCA 

TTTTTGTTTA 

CAGGAATGAA 

CTTTCACTGT 

CACCTCACAT 

TGTCTCCTTA 

GTATGAAACT 

TGATTAGAAG 

TTGGCGAATT 

AGATAATTTG 

T C ATGTGGT A 

TAATGAATCC 

AATATTCAAT 

CCACTGAAGT 

TTCTGGTATT 

CCCAGACCAG 

GTAGGATGCT 

AAGCTAATGA 

AATGTGTTAG 

TTAATCACGG 

ATTGTC TGAT 

AGTTGAAGAT 

GTACTTCATT 

TGGCTCACGC 

GAGATCGAGA 

TTAGCCGGGC 

AATGGCATGA 

ACCTGGGAGA 

ATGTTCCTAC 

AAAATGTTAT 

TGGAGAGTAG 

AACTTACATG 

CTTACTTGAC 

ACATATTATC 

AACATTACAA 



TAGGTGTATC 

AATCATATTT 

AAAGTTAAGG 

TGAGTGAATA 

TATGTAATTT 

AATGATTTGA 

TGAGTTTTTA 

AGCACATCAC 

GCAACTATCT 

TGCAGCAGAA 

GGTAGTGTCT 

TCTCCTTGTC 

GCCAGCGCTA 

GGGACCAAAG 

TGGTTGGGTC 

CCTGGGAGTG 

TGCCAGCTAA 

TAAAATAGTG 

GCATGACTCC 

GTCTGGTCCC 

AAAGCAAAAG 

AAGGACAGCT 

AATAATTTTC 

CAGGCCTCTG 

TCCACCCTGA 

ATATATGAGA 

ATATTAAAGT 

TAGTGAAATT 

CCTTACAATG 

CTTGGCCCGT 

TTTGTTTCAT 

ACAAATAAAG 

GTTCAAATTG 

TCTTCTGAGA 

TAATTCTCAA 

GAAGAAGGCC 

AACACAAGTG 

TTTGTCTCTC 

ATGGTTCAGG 

TATGTTAGAA 

GTATCTAGTA 

CGTTGCCAGC 

CTGTAATCCT 

CCATCTTGGC 

GTGTTGGCGG 

AC CTGGG AGG 

CACAGCGAGA 

TGCTCACTGG 

GACATCTAAG 

AAACGTATAG 

TGAAAGGAGC 

ATTACCAATA 

ATCAGCCACC 

AATAATTTCG 



TGCCTGGTTC 

ATTCCTAAGC 

TGAAAAATTA 

TAACTATCCT 

TAAATTTTTT 

ATCCAGTGTA 

CTTTGTTATT 

AGTTTGGAGT 

TGGACAGGAC 

TCCAGATGTT 

CTAATACTTT 

CACTTCCTAG 

CAGGAGACTA 

TTTTTAAGGA 

AGAGATGAAA 

GTGGGGTGGG 

TCCATTGTGT 

ATTTTATCCC 

TAAACCATAA 

CAGGCAGGAA 

TATAAACTAA 

TGGAGTTTAG 

TCAGTTATGA 

ACTAGAGGAT 

GGGAGTCCAA 

AGGAAATTAT 

GTGACACTGC 

CCTGAGGCTG 

CTGAAGTAAG 

GGAAGACTAT 

TGTTATTTCC 

TTAAAACAGC 

CTTAAGGTTG 

ACAGCACCAC 

CTCACAGGGT 

ACGTAAAATT 

TAAGGGCCTG 

TCTCTCTCTC 

CTGCTATTTT 

TCCTGATGAA 

TGGGGATAAT 

CAATCTGACG 

AGCACTTTGG 

TAACACGGTG 

GCGCCTGTAG 

CGGAGCTTGC 

CTCCGTCTCA 

AATAACTCAC 

TATTCAAAAC 

AGCCAGAAGC 

TTAACAGAGG 

ATGTGTTTTG 

CTGGAGGAAA 

ATCTGAAGAT 



TCAATTTGAC 

TATCACACTC 

AATTATTAGG 

CTATGTCCTG 

AATAGGCACA 

ACCAAAAATT 

TTACTAGTTC 

AGCCACATTT 

AGCTTTTATA 

TTCCAAGAAA 

ATATTTTATT 

ACAGAGCTGA 

GAGTTTTATT 

TAATTTGATT 

TTATAGGGAG 

GGACTCAAGA 

TCAGGGTCTG 

CAGGAGCAAT 

TTTATAATCT 

AGGGGTTTGT 

GCTCCTCCCA 

ACGTTAGATG 

TTTTTGCAAA 

TCCAACAATA 

TTTCTGAAAC 

ATATGATAAT 

CTGGCAATGA 

AACCTCCACT 

AATTTTACAC 

CAATGACAGT 

TTCTACACGT 

TTGCAGAGTT 

ACTTTATATT 

CATCCAAAGC 

GCTCCTGCAG 

TGGCCAGTGA 

TACTTCCAAG 

TGATTTTAAA 

CACTCAATCC 

AATATTTGGA 

AAGTTACGTG 

TAAGAATGGC 

GAGGCCGAGA 

AAACCCCGTT 

TCCCAGCTAC 

AGTGAG CCGA 

AAAAAAAAAA 

CTAAATTCCT 

ACATTCC CAG 

TAGTCTGGAA 

ATTTTCCAAA 

AAACTGAAAT 

GATTGAATTC 

GGAATCAGAG 



ACCCTTTCTC 

AAATATACTA 

TATTTCATAG 

GCACTGTTCC 

TTTTAAAAAG 

GTTTCAACAA 

TTTGAAATCT 

CCAATGCTTA 

CTCTGGGAAG 

ACACTTTTTC 

GGTTTGTCCT 

TTTATCAAGA 

ATTACTCAAA 

GTAGGGGACC 

CCTAAGCTGT 

C CAG ATAATC 

CAAAATAGCT 

TTGAGGTTTA 

TGTGGCTAAT 

TTCTGAAAGG 

AAGTTAGTTA 

GAGTCGGTTA 

GGCAGTTTCA 

CTTAGGCCAG 

AAAGGAAACT 

CAATTTTAGG 

TATCTGCTGG 

TCTGTAAAAT 

AATAATTCAG 

TAGTTTATAG 

TGGCCTCTCT 

GTCCCAGGGA 

CTCCTGACTA 

ATCATGCAAA 

AGATGTATTT 

TCTGGGGCAG 

GTGCAGAGAG 

ATTTGCAGTA 

TCCTTTTTAT 

ATTTGAGTAA 

ATTTG CATAT 

TTCAAGGAGG 

CGGGCGGATC 

TCTACTAAAA 

TTGGGAGGCT 

GATTGCGCCA 

AAAAAGAATG 

GGCAAGATGC 

CACTGAGAGT 

AGAATTCTTA 

TTTGAAAACA 

ACTTCTAAGT 

TATTTCCATT 

TATTCAGTCA 



TCTAAACATG 

CAGATCTGTG 

TTTTG CTAGT 

TCAGAAACAT 

TGAAAAAAGA 

GGTATCTAAT 

GGTGTGTATT 

ATACTCACAT 

ACACAAGCAA 

TGACCTGTTC 

ATTGTAACCA 

CAGGGGAATT 

TCAGTCTCCT 

AGTGAGTCGG 

CCTCTTGTGC 

CAGTTTATCT 

CAAGCATTGA 

GAATCTTGTA 

TTGTTAGTCC 

GCTGTTATTG 

ATCCCAAACT 

GGTAAGATCT 

CTGTCCACTT 

GACACCACCA 

ATATATGATA 

GTTATCTTAT 

TAGTAAGAAT 

GGAGACAGTG 

ACCAACCACT 

TTTATACTAT 

AAAAGAAGGT 

ACTCACTTAA 

ACCTTTCTCC 

CAGTGGTCAT 

GAATAGAGTG 

ATTTATCCTG 

GGGCCCTACA 

TTAAGGTACT 

TAAAATCACC 

GAGAAAGTTT 

GTGATCATGT 

CCGGGCGCGG 

ACGAGGTCAG 

ATACAAAAAA 

GAGGCAGGAG 

CTGCACTCCA 

GCTTCAAGGA 

AGGTCTAGAT 

GAGTGTCTAG 

CAAAGTTTAC 

ATCCTAAAAA 

TATGAAGAAA 

ACCTATAGAC 

AAACTACAGG 
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89101 


AAGGATTTTT 


89161 


TTCCTGACAG 
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TCAGACTCAT 
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CTGCCTGTTC 
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AATGTACTTT 
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C CGGTAGCG A 
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TATGAAAGCA 


90301 


TTTTATACAA 


90361 


CTTTATCCAA 


90421 


GAAACCGTGT 


90481 


AGGACTATAA 


90541 


AGTAGCTTTT 


90601 


CCCTAAAAAG 


90661 


TAAGCGCAGC 



TGGTAGTGTC 
TTGTTTTGTC 
TTTACAGTCT 
AGTTGTAAAA 
TATATTTAGA 
ACATGGGTAA 
GACCGCATAA 
TAGTGGACTT 
CTCCAGCAGC 
GTAACTACAC 
TGTCACGTGG 
TTTACACATA 
CTCAGCTTGT 
ACAAAACTTG 
TGGGTTACAA 
GGAAGAAACG 
TTGGGCATGA 
CCCACCAGGT 
AGGTCGGTTT 
ATCAGCAGCT 
CGGTAGCGAT 
GTAGCAAGCT 
CGAG CCATTT 
AAATATAGTG 
ATCATTGGCT 
GTTGCCCCAC 
AAATTCTAGT 
AAAATGTAAA 
TCTCGCAAGT 
GTCGGGAAAT 
CAGCAAGTTT 
TCAAAATGTC 
CTAAAGGAAG 
GGTGGCTCAC 
CCAGGAGTTC 
AAACTAGCTG 
GAGAACCGCG 
CCAGCCTGGG 
AATACCCTAA 
ATGCCGTTTC 
TCGTGGCTCT 
GCAATAGTGA 
AGGTGGCGGG 
ATCTCGGCGG 
TTGCCCTTTC 
CGAACAAGTT 
GCGGAAAACT 
ACTGCAAGGC 
TAGAAAAAGA 
TTCTTTTGTC 
ATACATGGGC 
CTATTCTGTT 
GGTTCTAAGA 
CGCAAGGAGA 



ATATTCAGAA 
AGCTTTTATA 
GCAGTACTTT 
GGTTTGATCC 
AAATGGATGA 
AAGAGCTTTG 
TCTTTGGATG 
TTGGCAGTGT 
CCAAAACCAA 
ACACATTGTC 
ATTAAAAGGA 
TACATGCTAA 
TTTCTCGCAG 
TGTGTAGTTA 
CTGTTTTTAA 
GGCAGACTTA 
TGGTGACGCG 
AGGCCTCACA 
TGAAGTCCTG 
CGGTGGACTT 
GAGGTTTCTT 
GCTTGCGCGG 
GCAATGAGAG 
AGAAACATTC 
GAAGAGTGAC 
CGCCCATCCT 
TCATCCAGTC 
TTCCGATTCA 
TATCAATGCT 
AACGCTTATA 
CCTTGCCCTT 
TTATTTTGGT 
GTGTTATTTT 
GCCTGTAATC 
AAGACAACCC 
GGCGCGGTAG 
TGAAGCGGCG 
TGACAGAACT 
CAGAAGCAAG 
AAGTGTAAGC 
GTTATTTTGG 
CGTTGCCCAG 
GGATGATGCT 
CCAGGTATTG 
GAAAAAGATG 
TTTGCTTTAG 
GTGAAAGACA 
TGCAATAGGA 
TAACATAAAT 
CAATCAGAAG 
TCTGAACTGT 
TAGGAATAGC 
AGGCTATCAC 
GCTATTCTAT 



GTTAATAAAA 
AAATTGGAAT 
TGCATTTTTA 
CCAGAAAACC 
AATCAGCATT 
CAGTTGCCAC 
GTCATACGCA 
GTACTGAGGC 
TACTGAATTT 
TTTATGATAG 
GTGACGGTGG 
AAAGACCCCT 
TAAAACCGGT 
TCACCTTTAT 
GTGAAATTGT 
TGCCCTTTCC 
TTTAGCGTGA 
AGCCTCCTGC 
GGCGATTTCT 
CTGGTAGCGA 
CACGCCACCG 
AGCTTTGCCG 
CACACACAAA 
TGATTGGTCC 
CAGACTGATT 
GTCCTTTTCG 
CCAAAGAACA 
GTAAGTTTGA 
GGTGAACACT 
TTCAGAGAAT 
TGTTTTCTAA 
TGGCCTTAAG 
CTCGAAACTT 
CCAGCATTTT 
TGGCTAAAAT 
CAGACGCCTG 
GGGTGGAGGT 
AGACTGTCTC 
TTATCATCCT 
TACGTTTTCT 
CAACAGGACG 
CTGCTTGTTG 
GCGGGTCTTG 
T AAGT AC AC T 
ACGGACTCTG 
CTCCATTTTC 
AGCAAGCTGG 
AGCTATCCTA 
TCCATATTTG 
TGAGGAATCT 
TCTCTGTACT 
AATG C CTGAA 
TAAGGCGCAG 
CTATGTGTAC 



TATGCTATTT 
TTGATTTTAT 
ATTTTACATT 
TTGATCTACC 
TGAATATTTT 
CCTTCATTCT 
AGTCTTGTGT 
CAGTTTCTTC 
TGGGGTCAGC 
CTTTAATAAT 
TGTCCCCAGG 
AGGAATTTTT 
TGAAAAGGCC 
ATCTCCTGGA 
GAGTGGCTCT 
CCACGGATGC 
ATAGCGCACA 
AGCGCCATCA 
CGCACCAGGC 
CGGATTTCGC 
GTGGCCGGAG 
CCGGTAGACT 
AGTGTAGTGA 
TGTAATATTT 
GGTTCATTAC 
TTTCAGTTAT 
GAGTGTATAA 
GTGGGACTTG 
CACTAAACCA 
GAGATTCCAT 
GTCCAAGTCA 
TTTCACTTTG 
AACTTTTTAA 
GGGAGGGCGA 
GGTGAAACCC 
TAATCCCAAG 
TGCAGTAAGC 
AAAACAAACC 
TTCTTGTGTA 
GATTTGAGTG 
GCCTGAATAT 
ACCTCCTCGT 
TCACGTATGG 
GGCGCACCGG 
CCCTATTGGG 
CACGTCCGCA 
AATGGCGCCT 
TTGGTCAATT 
CATAAACCCC 
TAAACCGTCA 
ACTCTGTAGT 
CCCTCTAAGT 
AAGAAGGATG 
AAGGTTCTGA 



TCTGAATTTT 
TTTCCCATTA 
ATAGTTTTTA 
CCATCAGTTA 
TAAATATTTA 
CAAATTCCCT 
ACTTGTTACA 
CACCTGAGCT 
TATTGTTTTT 
ACTGCCATCA 
AGCCTTTCAA 
TAACAAGGGC 
TGATAGACTT 
AACTAACATA 
GAAAAGAGCC 
GACGTGCCAG 
GATTGGTGTC 
CCGCAGAGCT 
GCTGGAACGG 
GCAAGGCCAC 
CGCTCTTACG 
TGCGAGCTGT 
ACTGAGAGCA 
CAAAAGTCCC 
TAGACAATCT 
CTGCAGCGAC 
CAAGGTATCT 
AAATTCTGCA 
CCAGAAACGT 
GCTATTTTGT 
CATTCCCACC 
TATACTCTAA 
CACCATTAGG 
GATGGGACGA 
CGTCTCGCAT 
TACACAGGAG 
CGATATCGCG 
AATCCAAACG 
ACTATGGACG 
TTTACTTGAC 
TGGACAGGAC 
CGTTTCGGAT 
CGCTGCCCAC 
CTCCGACCGG 
AACTGCAAGC 
AATAGCGACC 
GAACAAATCC 
ATGTTTGGTG 
ACCCCTCAGT 
TTTGAATCTC 
GGAGAGTGTT 
CTGCTCCAGC 
GTAAGAAGCG 
AGCAGGTCCA 
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CCCCGACACC 
CTTCGAGCGC 
CACCTCCAGG 
TGCTGTGTCC 
GTAAGCACTT 
TATAACCACA 
CTGGAATGTT 
GTGCCTCTTG 
GGAGATGGAG 
AATTAGCTGG 
AGAATGGCGT 
. CAGCCTGGGT 
ATATGAAGTT 
CCTGCCTTCT 
GGTTTCTGTA 
CATCCAGACC 
GCAGTGCAAA 
ATAGACGGAG 
ACTTGTATAT 
AGTTACTATA 
GTTCTCCAGA 
TTTATTTGTT 
GCAGCCCAGA 
TTCAAGTAAC 
TATTTTTTAT 
CATAATTGAT 
TGTCTCTGCT 
CCCCCTCAGT 
ATGCATTTCA 
TTTCCGAAGG 
TGCTAATCTT 
CAATGGCGCG 
CTCGCCCTCC 
ATTTTTAGTA 
GTGATCCGCC 
GACCAATCTG 
CAGAATTTTC 
TTGGTG AC C A 
TCTTCTATAA 
TTCTAATGAT 
TCACTGACCT 
TTCTCTACAC 
ATATACATAT 
CTCTCACCAA 
TAGTCTGTTC 
CAATTATTGT 
CTCATGATTC 
TAGCTCCATG 
TAAGATTTTT 
TGCAATGGTG 
GCCTCAGCCT 
TATTTTTATT 
CAGGTGATCC 
CCCGGCCACA 



GGCATCTCAT 

ATCGCGGGCG 

GAGATTCAGA 

GAGGGCACTA 

CCAAACCCAA 

ATTTCTTAAG 

AGCGAAGACA 

CCTTTAATCC 

ACCATCCTGC 

GCGTGATGGT 

GAACGCGGGA 

GACAGAGCGA 

TTGAAGCAGA 

TCCTTTGTTA 

CTATAGTCCC 

CCAAGAGAGG 

GTAAATGCAA 

CAGGACATTC 

ATGGGGAGAT 

TTTTGCAAGA 

TATAGGGATA 

CCCTTAACCG 

AAGTCTCAGC 

TCTGACACTT 

TTTTGAAATA 

AAGCCAAAAC 

GATACTCGGC 

TTATTACCAT 

CAAAACTTGC 

GTCCCATGTA 

TTTTTTTGTT 

ATCTCGGCTC 

CGAGTAGCTG 

GAGACAGGGT 

CGCCTCGTCC 

TCTTTTTGTA 

TTTTCCCCTA 

ATCTTACAGA 

GTGAGATTGT 

TATTTTCATT 

TCGCTTTTTA 

ACAAG ATTG C 

TTTGATTTTT 

CAGGGTGTTT 

AAATTGCCGA 

TTGAGACTGC 

TTGCCCATTT 

TATTAAAAGA 

CGATCTCGGC 
CCCCAGTAAT 
AGAGATGAGG 
ACCCGCCTCG 
TTTCTAAATT 



CCAAGGCCAT 

AGGCTTCTCG 

CGGCTGTGCG 

AGGCAGTTAC 

AGGCTCTTTT 

GTGGTGCTGC 

AGTTTTAGAG 

CGGCAATTTG 

TTAACACGAT 

GGGCGCCTGT 

GGCGGAGCTT 

GACTCCGTCT 

AATTATTTTG 

CAGAACTCCA 

TTCTGTGGTG 

GTTCTTGGAT 

GTTTACTAAG 

CCGAAAGTAA 

GTGCTCTGCT 

ATCAACATTA 

TCTGGACACT 

TAAACATCTA 

CTCATTTTCC 

TTCTTCTCTT 

AGAAATCAAG 

AAAAACCTAG 

TGATCGTTAA 

TAGATCATAT 

CATAAAAATT 

ATATAAAACT 

TTTTGAGACT 

ACTGCAACCT 

GGACCACAGA 

TTCACCGTGT 

TGCCAAAGTG 

GAGGGGCCTC 

CAATATAAAC 

AATTTTATCT 

ATTTCACTTT 

ACTGCATTTC 

AAAATTTAAA 

TGTAAGGGCA 

AATACATGTT 

TTTCCTGACT 

CATGAACAAT 

ACATTTTGAT 

TCTTTTGGGA 

TTATTAAGTT 

TTTTTGAGAC 

TCACCGCAAC 

TGGGACTACT 

TTTCTCCATG 

GCCTCCCAAA 

CTTTATAAGT 



GGGGATCATG 

CCTGGCTCAC 

CCTGCTGCTG 

CAAGTACACT 

CAGAGCCACC 

TGCTATTCTG 

CCAAGGTTAA 

GGAGGCCGAG 

GAAACCCCGT 

AGTCCCAGCT 

GCAGTGAGCC 

CAAAAAAAAA 

TCGTATGTTC 

ACACTTACCC 

GCCAGAAATA 

CCCGCGCAAG 

AAAGTAAAGT 

GAGGAGGAAG 

ACAAGTTTGT 

TTATCTTTAA 

CCTAAGTCTG 

GAAGCTAGGA 

TAGCCCTCAC 

TTTTTCTTCT 

AATACTTGAT 

GTCTTCTAAC 

TAGGTAATTA 

GCCTACTGTC 

CACAGGTTTC 

TATATTAAAT 

GAGCCTTGCT 

CCGCTTCCCA 

TACGTGCCAC 

TGGCCAGGAT 

CTCGGATTAC 

AAGCATGAAC 

ATTAATTGTA 

TGTGCAAGTC 

TCTAGTATCC 

ATTGTAGGGA 

CCATGTTACC 

AAAATAGAGA 

ACCAAGTTGC 

TCCACAAATG 

TAAATCTCAT 

AATAACATTT 

TGTTGCCTTA 

TGAGGGCTTA 

GGAGTTTCAC 

CTCCGCCTCC 

GGCAAGCGCC 

TTGGTCAGAC 

GTGCTGGGAT 

ATAAATTCAT 



AATTCCTTCG 

TACAATAAGC 

CCTGGGGAGC 

AG CTCTAAAT 

TACTTTGTCA 

TTTCAGTTCT 

CTTGGACGGG 

GCGGGCGGAT 

CTCTACTAAA 

ACTCGGGAGG 

GAGATCGCGC 

AAAAAAAAAA 

TTTCATAAAT 

AAAGGTAGCT 

TGTTACAGGA 

AAAGAGTTCA 

GGTGAAACGA 

GCATCCACCC 

GATAAAGGAT 

ACAAAATTAA 

AGTCTGTTTA 

ATGACTGACT 

TCAAAATGGA 

TTTTTCCTTC 

GTTTCATCTA 

TCAAAACTAG 

ACAAACAAGC 

AATCATATTA 

CCGCTTCCCT 

ACATTTGTAT 

CTGTCACCCA 

GGTTCAAGCG 

CATGCCCCGC 

GTTCTCAATC 

AG ACGTG AG C 

TTACTGATGG 

ATGTTATCAT 

TATGCAAACC 

TTTTAAATTA 

AGTAGATAAT 

ATGAAAATGC 

TAGGAATCAT 

CTCCTGAAGG 

CTCTTGAACA 

TGTTGTTTTT 

CTTCTATTAT 

TGTACATTAT 

TGATATGTCA 

ACTTGTTGCC 

AGGGTTCAAG 

ACCACGCCTG 

TGGTCTCGAA 

TACAGGTATG 

TCAATCTTCA 



TCAACGACAT 

GCTCGACCAT 

TGGCTAAGCA 

AAGTGCTTAT 

C AAGG AG AG C 

AGAGGATCAA 

GCCGTGCGCG 

CACGAGGTCA 

AATACAAAAT 

CTGAGGCAGG 

CATGG CACTC 

AATTAAAAAA 

TTTTTGCCTG 

GTTGGGTCAG 

AAGAGGTCCC 

GGGTGAGTCC 

CAACTACTCC 

TAGGTACAAT 

TAATTTTCTT 

GAATGCCTTT 

GTAAACATTA 

TTCTGGGAAT 

GTTACTCTGG 

CTTTATTTTT 

AAACAATACC 

GATGTTTTGC 

CTTGCTATGT 

ATCCACAACT 

CGAGTTTTCA 

GCTTTTCTCT 

GGCTGGAGTG 

ATTCTACTGC 

TAATTTTTGT 

TCCTTACCTC 

CACTGCACCC 

GTGAGAAAAA 

TCAGGACATT 

AATATGTAAA 

ATAAAAGAGA 

TGCCCTTTAT 

TTTTCAGTAT 

GCATCCATTG 

TCTGTTTACA 

GTGGGTGTGT 

ATTTTTAAGA 

GGTTTGATTA 

TTTAAATAGA 

GTTACATTTC 

CAGGCTGGAG 

CAATTCTCCT 

GCTAATTTTG 

CTGCCGACCT 

AGCCACTGGG 

CCAAAACTCA 
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93 961 ATGAAGTGTG AGTACTATTA TTATCATTGT TTTACAGATC AAAACAAGTA ATACAGTCAC 

94 021 TTACTGAGTT CTATACACCT GGTAATTTTT TTGTTTCGTT GTTCTATCAA TTATTGGGGA 
94 081 AGGGGTGTTG AAATCTCTAC CTTTAAATCA TGTATGTGTC TATTTCTCCT TTCGGTTCTA 
94141 TCAGGTTTTG CTACACATAT TTTGCAGTTC TGTTATTTGG TGCATATACA TTTAGAATTG 
94201 CTTGTTTTTC GTATTGGATT GACCCTGTTA TCATTATGTA ATATCCCTGT CTGTTCCTAG 
94 261 TAATTTTCTT TGCTCTGAAA TATACTTATC TGATATATCA TCCAAAAGAC CACCAGGATG 
94 321 GCTAAAGAGT AGAAAGGAGA GATTTACTGG CAATACTAAT TTGCAAGCCA GGAAGAGATG 
943 81 GTCCCAGAAC CTGCCAAAAT TACTCTCTCT TTGGGGAGAA GGAG C AGGTT GGTTATTTTT 
94441 ATGCCTCATA GGCTATATAT TACACAATAG AGTCATACAT ATTTAGCACG TTTGGGGGGA 
94501 CAGCTATATA TATTATGAGG GGTGCCAAGT GCATTCACAA TGGATAAACA CGTGTAATAT 
94 561 ACCTCCCATG TTCACTTCGA GGTTAAATTT TGGTTAAAAT GAGGTAGAAT TTAGGTCTTT 
94 621 ACATCACAAG GTGAACTATA GGAACAAAGT TTACGTGCTG CCTCTAGCAG CTGGCTGAAA 
94 681 ATGGCTTAAG GTCTACAATT ACGTGTAAGA ATAGAATGTG TGTCAAGGCG GTCCTCTGTC 
94 741 CAATCAGAGT TGTAGTGGAC TGGACTGTAA ATCAGAGTTA GGAGGGCTTC TGATAGCTCC 
94801 TATAGTTAAG GAATTTAGCA AGTGTGAGTT TTTTGGTAGT CTTTGGAATT TAGGAATTTG 
94 861 CCATGCCAGC CAAGCCATGA ATGCTCTACC AGTAGGTAAC TTTGTTTGCT TAATCTTAGA 
94 921 GTCTGTCTTA GTTGGTATAG GGGCATCTAT TTTGGTCTTT CAGATCCCAG ATATTATTAA 

94 981 TACAGATACT CTTGCAGTTT TGGGCTGATG TTTATATGGC TTATCTTTTT TGCAGCCTTT 
95041 AATTTCAACC TGCGTTATGT TTATATTTGA AGTGAGATTC TTGCAGACAG TGTACAGTTG 
95101 TTGTTTTTTT TTTTTTGAGA TGGAATTTCA CTCTTGTTGT CCAGGCTGGG GTGCAGTGGC 
95161 ACAGTCTCAG CTCACTG CAA CCTCCGCCTC CTGGGTTCAA GGGATTCTCC TGCCTCAGCC 
95221 TCTTGAGCAG CTGGGATTGC AGCCATGCGC CACCACACCC GGCTAATTTT TGTATTTTTA 
952 81 GTAGAGACAG GATTCACCAT GTTGCCCAGG CTGGTCTCGA ACTCCTGACC TCAAGTGATC 
95341 CGCCAGCCTC GGCCTACCAA AGTGCTGGGA TTACAGGTGT GAGACCTCGC GCCCAGCCAA 
95401 ACTGTTTTTT TATGGGTGTA TTTATACCAC ACACATTTAA TGCAATTATT GATATCTTAG 
95461 GGCTTAAGTT CATGAAGGGT AGTGTGGGAA CCATAGTCTC TTGGCCCACT AAATGTTTGC 
95521 CAGAAATCAC TGACAAGGCA GATTGATTAA TAGGTGAAAA GGCATTTTAC CTATTGTTTA 
955 81 ACGTGTCTAT GTGGGAGCAT TCAGAATTAA TTACCTAACT TCCCAATGAG TTATAGATGC 
95641 TTATATACCA TTTTTAGATC ACAGAAAGAA TTGGGGCTTA GATTCTGGTA AAACAGGTTA 
95701 TGGGAGGCAA AAGAGGTTTG GCTTGCAAAG GTGGCCTTGT TAGGTAGGTG AAGCCTCCCT 
95761 CAGAAAGAAC AGATGGTAAA TGTTTCTTTT ATGATTTTTA AGTGTCAGAC TCTCAGTCTC 
95821 TCCTGGATCT GGGGAAAGGT ATAGAAAGGT GAGGAGGCAT GGCTGCATTA ATGGAGATTC 
9 5 881 TCTACAGATG TAAAATTTTT CCCATTTAAG GCAGCTTTGC AAGCCCATTT CTGCCTGCTG 

95 941 GCCAAGCAGC AGCCATTTCA AAATATGTCA AAGAAATATA TTTTGGGGTA AAATATTTTG 

96 001 ATTTCCTTTA GACTGGTGGC CTTATAAGAA AAGGAAGAGA CACCTGAGCT GACACACATA 
96061 CCCTTGCTCT CTCAACATGT TATGATGCAG TAAGAAGGCC CTCACCAGAT ACTAATTCCA 
96121 TGCCCTTAGC TTCCCAGGTT CTAGAACAGT AGGAAATAAA TTTCTTTTCT TTAAAAGTTA 
96181 GCCAGTCTGT GGTATTCTGT TATAGTATCA CAAAATGGAC TAAGTAACTA TATTATGATC 
96241 ATCTTACATG ACTGATCCCT CCTACATCAT ACACATACAC AGGCCACATT TGGAACATTG 
963 01 TTAGAGGTTC CTCTGCCCAG TACAAATGTA CTACAAATTA TATATGTATT TTTAAATTTT 
96361 TGAGTATCTT CAATAGTATA TTTTCGTTAA CTTTTGTAGT CAAAATGTCA TTATAACATG 
96421 TATTCAATAT GCATAATTAT TAGTCAGATG TTTTACATTC TTTCTTCATA CTAAGTGATA 
96481 TGGTTTGGAT ATTTGTCCCC TCTAAATCTC ATGTTGAAAT GTAATCTCCA ATGTTGGAAG 
96541 TGAAGCCTGG TGAAAGGTTT TTGGATCGTG AGGGTGAACC CCTCATGAAG CGCACTCTTC 
96601 AGGGTAATCA ATGGGTTCTC ACTTTGAGTT CACAAGAGAT CTGGTTCTTT AAAAGAGTGT 
96661 GACACCTCCC CCATCTCTCT CGCTCAGCTC TCACCATATG ATATGCCTAC TCCCTCTTCA 
96 721 CCTTCCACCA TGATTGGAAG TTTCCTGAGG ACTTGCCAGT AG CAG ATGCC TGCACCACAC 
96781 CTCCTGTACA GCCTGCACAA CCGTGAGCCA AAAAAAATTA CTTTTCTTTA TAAATTAGTC 
96 841 AGTTTCAGGG ATTCCCTTAT AGTAATGCAA GAACGAACTA ACACACTAAG TCTATTTCAT 
96901 ATTTACAGAA TAGCTCAATC TGAAGTACCC TTTTTCAACT TCACAGTAGC TACTTGTAGC 
96 961 TAGTGGGCAC TGATTTGGAG CGTGTTCAAG GGTGAATTGT ATTATGCAAT TAACAGATTT 
97021 TTTTTATTGT TTTCGCAAAC CACGAGGCAT AGATTGTCTT ACTTTCTCTG CTCCTGGTGT 
97081 TGGAGTTGTT ATTGGGAAAC AACTTATTTT CCTCTTATAT TTATATGGAA TAAATAACCC 
97141 CCAATATTTC CCTCCCCAAT ATCTGCCTTT TGTATGTTTT TTGAAGGCAA GTGCCTAGAA 
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TTTACTGTTT 
ATGCCAGGCG 
CTCATCCATG 
TCTAGTCTGA 
ATATCCAAGG 
TTTCTGAACC 
GTATTTATCC 
ATTCTCCTGA 
TTTTTCTCCA 
GCACCTCCCA 
CTATGCTCCC 
TGAACTAATA 
ATTGATCAAT 
ATATGCAGAA 
ATAATCTCTT 
CCTTTTCAAA 
ATCATTATAA 
AGAAGACTTG 
TTTCTCTGAA 
TTTTTTTTTT 
GGCTCACTGC 
AGCTGGGATT 
CTGGGGTTTC 
GCCTTGGCCT 
GATTTTTTTA 
ACAGATAGAA 
ACTCCATCTG 
ATCTGTCTTG 
CCCGGCCAAG 
ACACAGATTA 
TTAGAATTAA 
GATAAACAGC 
GGCTTGTCTG 
GGACAAGACT 
GAGTAATACT 
CTACCTTGAG 
ACAGGAAGGC 
TTCTGTGTTA 
AGTGTTGTGA 
TCTTTGAACT 
AATCAAAAAT 
TCCCTGGGGA 
GTTAACACCA 
ATTTTTCTCC 
TATGAATCAA 
CTTCCCTCCC 
GCCCACGCTG 
AAGCGATCCT 
CCGGCTTTTT 
GTTTCTCCAT 
AGTGCTGGTA 
TTTAGAAATT 
ACGAGCGGCT 
TTTTGACAAA 



TTGAAGCACT 
CTTGTTGGTT 
GCTCAGTGGA 
GTTTTTGAAG 
CTCTTTCCAA 
ACGGTTCCTG 
TTCCTACTTG 
TGAAACTTTC 
CAGCACTCAT 
CTACAAGACA 
TGC AC CTAGA 
ATGCTGGATA 
CTTCTTTTCC 
CGTGCACTGC 
CAGGGCACTA 
GAAAATGAGC 
TTTTGAAAAG 
GGAGAAGGCA 
TCAAATCCAT 
TGAAATGGAG 
AACCTCTGTC 
ACAGGCTCCC 
ACCATGTTGG 
CCCAAAGTGT 
CACTCATGTT 
GTAGTAGATA 
CTCCTATCTC 
ATTTTAGGTT 
GAAAAACTTC 
ACTGGAGAAA 
GACTGAAAGA 
TGTATAGGGT 
TCAAGATTCT 
CTCTTTTAGA 
TTTAGGTTTT 
GAGGAATTCT 
AGAAGGTGGT 
TGGAATGTTT 
AAAAGTTCAG 
GAGGGCACCT 
TTGAAAATTA 
ATCTCATCAA 
TCTAAACAGA 
AAAATCATAT 
GAGAGCTTAT 
ACTCCCCCTC 
GAGTGTGGTG 
CCCACCTCAG 
TTTTT^ZTT^TT 
GTTGTCCACG 
TTACGGGCAT 
GGTCGGAGTC 
GAAAGTCAAA 
ATCCTAATTC 



TACTGAAAGG 
TGCTTAATTC 
GTATAGATTA 
CTACCCTTAA 
AATGGTCTAC 
ACATTTTCTG 
GCTGGCTTCT 
CATCCTTATT 
CACTTATCTC 
AGTAGCACCG 
ACACTCTCTG 
TACATCTCCC 
ATGTG CTTTT 
TATTTAATCT 
TCTGAGATAA 
CAGTGATTAC 
GGAAGTTGAA 
AAAAACAAAC 
AGTTCTGTGA 
TTTCGCTCTT 
TCCAGGGTTC 
ACAACCACGC 
CCAGGCTGGT 
TGGGATTACA 
TCTTTTTCCT 
CCTCAGAAAT 
ATGGAATATA 
CCTCAACAGG 
CCCTTTGCCC 
AGGCATATAT 
TACAGGGGAA 
ACGATCTAAT 
TCTTGACCTC 
ATGGGGGGTC 
ATGGCTGGTT 
GGTTTCTATG 
CAGTGAAACA 
GTTCTCTCAT 
GAAATGCAAC 
AGGAAACAGT 
AAAAAAAATT 
C C AG AG AAG A 
CTTTGTCACA 
ACTCTCCCCT 
AAGCTTCTAC 



GCTCTATGTG 
CTTCTCGAGT 
TCTCCCCCGT 
CTGGTCTCGA 
GAGCCACTGC 
CACTCCTTTC 
ATAACCAGAA 
GGCCAATTAT 



ATTGCCATCA 
AAGGTAACTT 
CTGATATTGT 
TCTTGGTTTC 
GATTTGTTTA 
GACTTCAAAC 
TCCTTGCCTT 
TCTATTCTTT 
TACATTTTCA 
TAAGGAAACA 
GCACTTAGCA 
TCATGAACTC 
GTATGATTTA 
TCATGTACGT 
CTTTTTAACA 
TGATGTTTAC 
TATTGTGAAG 
TAAAAATGAG 
CAGCGTTGGC 
GCCCAGGCTG 
AAGCGATTCT 
CCAGCTAATT 
TACGAACTCC 
GGCATCAGCC 
TCTGTCATCC 
TCCTGGAATA 
AAAGGAAAAA 
AG AG C C AG AC 
TCCCAAGGTT 
ATTTATTTCA 
ATTGCCCATT 
GCTAACAGAC 
TCAGTGCAGC 
TTATG AC CT A 
CTAGGGAAAA 
GCTAGACTTT 
CTTTTATAAT 
TTCCTGAAAG 
TCAAAAATGT 
AAATTCAAGG 
CAAAAAGGAA 
TTAACTGTAT 
GCTGTCACCT 
AAGTTGCCTA 
AGTTCACTGG 
GTCTTTGAGA 
AACTCACTGC 
AACTGGAACT 
TTCTTTTTTG 
ACGCCTGACC 
GCCCGATTTG 
CAAAAACATG 
CAAAACCTCC 
TATTAGTATT 



AGTTGTTTTG 
GGATGAGAAG 
GACTGGATGT 
AATTTTATCT 
GGAAGTTAGA 
ACATCCAGCA 
CAGGTCTGAA 
TTTCTTATCC 
TTATGTATTT 
GGTTGTCTGC 
GGTTTTCAGT 
TCTAAATCCT 
TTGCTCAAAA 
AAGTCCTCCC 
TCTCCATCAT 
GGCTATTGTT 
GGAAAGATAA 
CACTTTTAGT 
TTAGAAGCAG 
GAGTGCAGTG 
CCTGCTTCAG 
TTTTGTATTT 
TGTTCTCAAG 
ACCGTGCCCA 
TGTTTCAGTA 
ATTAATCCAC 
CACCAAGATT 
AATGGCTGTA 
TATGGAAAAT 
TCACAATTTT 
TTTATGCTTA 
TGAGTGGGGA 
ATTTCTTCCT 
CAGGCAAACA 
GGAGTTCTGG 
GGGGAGAATG 
CATAATCCCA 
ATTCCAGAGA 
GCCACTTTGT 
AAGGGCTTTC 
TTTAGTTGTT 
CACAGGAGAG 
ATTCTTTGAA 
CATCCCCCTT 
GATTTGGGGT 
CACAGTCTTC 
AACCTCCTCC 
ACAGGCGTGC 
GTTATTTTAC 
CGCCGTCCTC 
AAGGACCTCT 
AGTCACAATC 
ACTCATGCTT 
CAAGTCGAAG 



CTAATAGTAC 
AAGAGTTTTT 
ACTCCTGCTT 
AGCCCTGTAC 
ATAGCTGTAC 
TTTTATCGAA 
TTCAAATGAC 
CCTTTCTTTA 
ACCTTATTGT 
TTTTTCACTG 
AAATATATGC 
TCTAATTTAC 
TCTTTATTTT 
TTCTCTGAGT 
GAATCTTGTA 
GAGGGTGAAG 
CACTAGAGTC 
CTCCTGACAG 
ATTTTTTTTT 
GCACGATCTC 
CCTATGGAGT 
TTAGTGAAGA 
TGATCTGCCC 
GCCAGGAGCA 
TAAGCAGACC 
GTTCATCTGT 
TCCCTAGGCA 
ATAATATTGT 
TACTGGCAAA 
ACAGGAGATT 
GGTTCAACAA 
AGCCCCGCAA 
TCTGGTTATA 
AGGTAGGTTA 
TTTGTATGGC 
GGACTTACAG 
TTTTGAGTAT 
CTCCTCATTC 
TACGCTGATT 
GCTGAACTCT 
AAGATTCACT 
GAGACTGGTG 
AC AC C C ATTT 
CTTTCTCCCT 
ATTCG CTTTT 
TGGCTCTGTC 
TCTCGGGTTC 
ACTACCAAGC 
TGGAGACAGG 
GGCCTCCCAA 
TAAATATCTA 
CGGGAAAAGC 
AAAAAAGGTA 
GCTCGTCAAG 
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CCAGACTGGG 
GAAGGATGCA 
CACAGCCTAC 
TACATTTTCT 
AGGCGGTAAA 
TGTGGGCCGA 
CGCGCCGGTG 
GGGCAATGCG 
CATCCGCAAT 
CGTTTTGCCT 
CAAGGGAAAG 
CAGCCTAACA 
TGCTTTGGAT 
GGCACTTTTA 
TATGTGGGAG 
TCACACACAG 
ACTATCTTGA 
CATTTGCTGT 
TTGAGTATGT 
GGGGTGGAGT 
ACCTCGAACT 
CGAGAGCCGC 
CCCAGAAAAA 
TTCTCTGATT 
TGTGGTTGTA 
ATATAG CAGC 
CTCTCTTAAT 
TGAATTTTCT 
CTTTCCTCCA 
AAACAGACAG 
GCTGCTCGCA 
GGCACTGTGG 
AAGCTGCCGT 
TTCCAGAGCT 
TTTGAGGACA 
ATCCAGCTCG 
AGTCTTAAAA 
GATAATTTTT 
CTATGGTCTT 
TTTCTGACCT 
CCTCTAGCTT 
TGGTAAGTAG 
ATTGTGTCTT 
ATACATAGTC 
TAAATTCATT 
GGATCTAACT 
TCGACCAAGC 
AATAGTC TAT 
ATTCTTTTTT 
CCATTTTCTT 
GAAAATTTAG 
TTATATTTTG 
AACAACCACA 
TTTTTACAAT 



GATTGGGTCA 
GATTCTGATT 
CTCCAGTCAG 
TGTGGCGATT 
GCTCGCGCCA 
GTGCACCGCC 
TATCTCGCGG 
GCCCGCGACA 
GACGAGGAGC 
AATATTCAGG 
TGAAGAGTTA 
GCAAAGGCTC 
TATGCCGCCC 
GTAATTTGTC 
AAGTGC CATG 
CAGTTACTAC 
ATGGAAGTGT 
AGCAATTAAT 
ATTTCCCAAA 
CTCCCTCTCG 
CGGGCTCAAG 
CACGCCCGGC 
TACAATTTTA 
TCTTTTTTAT 
AATTTTAAGA 
TAAGGGGTTA 
CTGCAACCAG 
TGGGTCCAAT 
C AG ACGTCT C 
CTCGGAAATC 
AGAGCGCGCC 
CTCTGCGCGA 
TCCAGCGCCT 
CTGCGGTGAT 
CAAACCTTTG 
CTCGCCGCAT 
CCCAAAGGCT 
TGTTGTCTTA 
AAAGTTGATT 
TATTAAGGTG 
G CTATG ATT A 
TTAACTGGCG 
GCGAGTTCCA 
CCCTAGGTTT 
TGGCTGTTAA 
GGCTCTCATT 
ATAATTTAAT 
CTGGTTGCAT 
TCAGTACTTC 
TCTGTTGATC 
AGGGATGGGA 
TATTTTTTTA 
GAATGTCCAA 
AAATATTTTT 



AACATAAACC 
TCCCATTGGG 
TATAAATACT 
TTCCCTTATC 
AGGCTAAGAC 
TGCTCCGCAA 
CGGTGCTTGA 
ACAAGAAGAC 
TTAATAAACT 
CGGTGCTGCT 
ACGCTTCATG 
TTTTCAGAGC 
ATAAAGATGT 
CTGCAGAAAT 
CAGCACAAAA 
ATTTTAGAGG 
TAAAACCCGC 
GGCATACACA 
ATGAGCTTTT 
CCCAAGCTGC 
CGATCCTCTT 
TAAGAGCATT 
AATAAAGCG C 
ATTTTAACTA 
CTTCAGGAAA 
ACAAAATGAC 
GCACAGAGAT 
AGTTGGTGGT 
TGCAGGCAAG 
CACCGGCGGT 
GGCTACCGGC 
GATCCGCCGC 
GGTGCGAGAA 
GGCGCTGCAG 
CGCCATCCAT 
TCGCGGAGAA 
CTTTTCAGAG 
ACAGAACAAA 
AACAGAAATA 
CTATTTGGAG 
GCATTTGTTT 
CTTACTAGGC 
GTGTCTTCCT 
TCTCATATAT 
CATTAACCTG 
TTATCCATAG 
CCCTTATATA 
TGCTTTGTCT 
TTGCCTGTAG 
TTGCTTTTCG 
ATACTGTACG 
TC ATATAG CT 
CATTAAAACT 
TACACCTAAC 



TTACACCAGA 
TATTTGACAT 
TCTCTGCCTT 
AGAAGTAGTT 
TCGGTCTTCT 
AGGCAACTAC 
GTACCTGACC 
CCGCATCATC 
CTTGGGGCGT 
GCCTAAGAAA 
CACTGCTGTT 
CACCTACGAC 
TTTTGAGGTG 
TAGATCCATA 
CATGTTTACA 
AAGGAAATTA 
ATGCCCCACA 
ATTG AG AG CA 
TTCCAGTTTG 
AGTGCAGCGG 
GACAGCCTTC 
TTTCTAATTG 
ATATGCAAAT 
GAAACAATTG 
CTTTTCCAGT 
GTCAGAGTAG 
GGACCAATCC 
CTGACTCTAT 
CTTTTCTGTG 
AAAGCGCCAC 
GGCGTGAAAA 
TACCAAAAGT 
ATCGCCCAAG 
GAGGCTTGTG 
GCTAAGCGAG 
AGAGCGTAAA 
CCACCCACTT 
TTTCTAAGGA 
ACGGTTTGGT 
AGAAGCTGTG 
AAACAACTTT 
ATTTTTGCAA 
CAAAATGCTT 
TATATATATA 
AAATTTATTC 
CTAGCTACCC 
TGAATTTTTA 
CCTCTAGGAC 
TTATTAAAAT 
GTTTTGGAGG 
CAAACAAAAG 
TTTACATCAC 
ACTAATTCCA 
ATTCTTTCTT 



CGGAAGGATT 
TAGCCAATGG 
GCGTTCTAAT 
ATGTCTGGTC 
CGTGCAGGTT 
TCCGAGCGCG 
GCCGAGATCC 
CCGCGCCACC 
GTGACCATCG 
ACTGAGAGCC 
TTTCTGTCAG 
TTCCATTAAA 
TTTTTAATGG 
GAAACCTCAG 
GGGGTGATTC 
TACCCATGAG 
CAAGTTTGAA 
CACACATTAC 
GGGATGTTTT 
CGTGATAACA 
TGAGTAGCTG 
CCCACACTTC 
TTCCCTAATC 
GAGGTTTCCG 
ACAAGACTTG 
CTACGGTAAT 
AAGAAGGGCG 
AAAAGAAGAG 
GTTTTGCCAT 
GCAAGCAGCT 
AGCCTCACCG 
CGACCGAGTT 
ACTTCAAGAC 
AGGCCTACTT 
TGACTATTAT 
TGTAAAGTCA 
ATTCCAACGA 
CCCCCCCGGA 
CAGTCTTGCA 
TAAGTCCACT 
GTAAGAGTAA 
AGCTTTGAAA 
AGGAAGATTT 
TATATATATA 
TGGTGCAAAA 
ACTTTAAATC 
TATGTGTGGC 
T ATG C AC CAT 
CTAGAATTTA 
TTGGGGATTG 
TAATATTTAC 
ATTTTACAGA 
AAGACCTTGC 
GGCCTACATC 



ACATGCAAAT 
GAGAATTCCT 
GTAGTTTCAT 
GCGGCAAACA 
TGCAGTTTCC 
TCGGGGCTGG 
TGGAGCTGGC 
TGCAATTGGC 
CGCAGGGTGG 
ATCATAAGGC 
CAGACAAAAT 
TG AG CTGTTG 
CTTTGAGTGT 
GAATTCTAGG 
GCGTTAAGTT 
TGCATTCCTA 
TATGTCATAC 
CACTGAACAT 
GCTTTGTTTT 
GCTCACTGTA 
GGATTACAGG 
TTATGCGACA 
GTCTCCAATA 
CGTTGCTTTG 
TCCACAGTGG 
GGGCAGGAGC 
CGGGGATTTT 
TAGCTCTTTC 
GGCTCGTACT 
GGCTACCAAG 
TTACCGCCCG 
GCTGATTCGG 
CGATCTTCGC 
GGTAGGGCTC 
GCCCAAAGAC 
CTTTTTCATC 
AAGTAGCTGT 
AAGCATTAGA 
GTGTAGGTTA 
ATCATTCAGG 
GGGAAAAATC 
AGATTAGAAA 
TCTCAGCTCA 
TATATACTGT 
TGTGAGGCAG 
TGTCAGTCTG 
TTTGCTTGTA 
GACATGCCAC 
CAAGTTTTAA 
AGTACTGGAA 
TTTAAAATTT 
CTAACTTTAG 
CTCACATTCT 
TAGAATGTAA 
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ACTGATGTAC 
AAAGATTAAA 
GGCAGTTTTT 
AGTTTGTTTT 
ATTTGTTATC 
AGGTGACTAC 
GATGTGCTGG 
AAATGTGTTG 
CTGAGGATCA 
ACATTCTTTG 
GATATTTAAA 
TTTTGTATTT 
TTAGAGGAGT 
CATCCAAAAA 
AGCATTCAAG 
AATGTGAAAA 
AGATGAGATA 
AACAAACCTT 
ACAATTGCCT 
TCACTTTTTT 
TTCATTCTGC 
AGAGTAGAGG 
TGTTCCCAGT 
TGAACTGCAC 
AATGCCCTGT 
TCCTGACTCA 
ACAACCCCCT 
CCATCTCCCT 
GCCTTGTTGC 
TTTAACAGGG 
TACAGCTAAT 
CTAAGATTCT 
CAGTAGCCCT 
ACAACAGTGA 
CACGAATGCT 
TTTTAACATG 
AGGCTGTGAT 
AGTGTTGGGA 
TTTTTTTAAG 
CAGTCTCTTC 
ACCCGATTTA 
GTACAGTGTG 
CAAATTAAAT 
GGTTTTACGT 
ACTTATATAA 
AGGCTAGAAT 
AGAATTCTCT 
GTGAGAACAT 
AAGAAAACAT 
TCAATATGTC 
AAGCCCTTAA 
TAGTAGTGTC 
CTCCCTCGCA 
CGCCTCCTTA 



CATACTAAAA 
TTTGAATTGC 
GGTTTATAAA 
ATGATTCGTT 
CACAGACAGT 
TCACAAGCTC 
ATCTGATGTA 
GAGGTACTGG 
GATGGACAGG 
CCCCCTCCTC 
AATTAACGAA 
GTGAGTAGTG 
AATTAAGGAG 
TAGACCACTG 
AAGGGAATTC 
ATGCTCTGCT 
ATTCTGCACA 
GTTAAAATAA 
CTCTTTAACT 
ATGAGGGTTC 
TAATCTGTTT 
TAAAATTTTG 
TGTTCAGTTT 
GTATGCCTCT 
TCCTGCCTTA 
AAAGCTCCCC 
TTGACTGTAA 
TCGCTGACTC 
TCACACAAAC 
TTTTTCCTGC 
G CTGTTG AAG 
CAGAGAAAGA 
TGCAGTTTTT 
GAAGCGTGTA 
GCTATTGTAT 
TCACATATGA 
TTTAAAAAAA 
GTAAAAACAC 
TAGTAACATC 
TCATGCCTCG 
CAGATGAAGG 
AAGGATAGCA 
TAATGTAAAA 
GACACAGGTG 
TGAATTGGAC 
ATATAACTGT 
TGGCCTCAGC 
CTTTCATATG 
AAGGCCAGAG 
TTCAAGCACT 
TATAAGCCCT 
GTAAACGGGA 
GTCCTTAGGT 
TCCTCGCTCC 



TCGCCTGACC 
ATCGTTTACT 
GTAATATTTG 
TTTCTTGACT 
ATAGATATGT 
TGGGCCCAGC 
TAGCGCTTGA 
GTCTGACGAA 
GGGTGGTAGC 
AACAGAACAA 
TGGATGAAAT 
CAAACATTTT 
AGATTGGAGA 
TGGGATCAGG 
TTCTAAACTT 
TGTACCAGGT 
AACACAGCAG 
TCATATATTC 
TAATGTGAAA 
TCCTGTCCCA 
TATGGCAAAT 
CCTCCCTACA 
GTCAGGCCTC 
AGATGGCCTG 
ACTGATGACA 
CACTGAGCAC 
TTTTCCACTA 
TTTTCGGACT 
CCTGTTTGAT 
CCAGTCACAA 
TCTAAAATCA 
AGTCAAGTTT 
CCAATAGAAG 
TGGAGAGTTG 
TGCACCTTGG 
AAAGCTAAAC 
CAATCCTTAC 
GAAAATGAGA 
TAAAATTAAA 
TTCACATTAG 
C ACGGTTG C A 
AAACTCCACT 
TGGATTAACA 
CTCTCATAAG 
AATTAGTAAA 
GTAGAGAAGC 
CTCCTATCCT 
AGAATTTCAC 
TGATCTTTTC 
TGAAAGACTT 
TATTAAAATT 
GGGAAAAACT 
CACTGCCCCT 
CGCTTTCAGT 



AACTGTCAAC 
TAAATTCATT 
CATTTTAAAA 
CTTATACAAG 
T AG AG ATGC C 
TGAAGGTCAA 
CTTTTTATAT 
TAGCATAAAA 
TCAGTCCAGC 
GGATTCTGCT 
TCTCATTTGT 
AATATTATAT 
CAAAAAGGGG 
ATTCTTTTGA 
TTCTTTCTGA 
GAAAAGACAT 
GGAGTCATAG 
CTTTAATCTC 
GCATTTAGCT 
TAAAATTTAC 
GAATTATCAG 
AGATAGAGAT 
TGAG CCGAAG 
AAGTAACTGA 
TTACCTTGTG 
CTTGTGACCC 
TCTACCCAAA 
CAGCCCGCCT 
GGTCTCTTCA 
CAAAGTGATG 
GTTTTGGTTT 
GGGGTGCATT 
TGATTTAAGA 
AACTACACTC 
AAAAGAGAAC 
GGAATCTGTC 
TAATACATAC 
GTTCAGGACA 
CCATATTATG 
CTAATTAAAA 
ATGAGCTATC 
CCCATCCTCT 
GGAGAAAGGT 
GTAATGAAAG 
ATGTAAAAAT 
GCCCAGCAAG 
TGAGAAGAAT 
CTACTGCTTC 
ACGCCTGCTC 
AAAAAGTTTA 
CTCAGTCGAG 
AAAGGGATTA 
CGAGGGGCGG 
TCTCAATAAG 



AACAACAAAT 
TGTGTTCCAG 
ATTATGAAAA 
CGACTCTAAC 
AATGGACTTG 
GTATTTTTTT 
TTTCTTTATC 
GAATAAAGTT 
TATTTTCCAC 
GTAACTCTTC 
GAAAGAAAAT 
TAAGAATCTA 
GTGTTGTTTG 
GCTAAAGGCA 
AAACAGGAGA 
ATTCTTCAGC 
CCG AG AG AC T 
CTCATATGGT 
TTTGCCATTT 
ATTAAATACA 
GTCCAGCTGG 
TGTGTGCATT 
CTAAGCCATC 
AGAAACACAA 
AAATTCCTTC 
CCACCCCTGC 
TCTTATAAAA 
GCACCCAGGT 
CACGGACGCG 
TTATGCTGCA 
GTTAGATTTG 
TTTCAGACTT 
ATGTTTTCAG 
CAGACTTGGC 
AAAGGAATAT 
AACACCTTGT 
ATAGTTGCTG 
ATATCCCAAC 
TAATATTTAT 
GTCCCCTGAG 
ACCCTCTTCT 
TAGGGCTCTG 
ATATGCATTT 
CCCAAAAAAA 
GCGCTAAAGC 
GGCTAGTGCA 
GTTGCTTTTT 
TAAGAAACAG 
T TTT AAGTAC 
CCACTCCGGC 
GGTATAAATT 
AAAAGTGAAA 
AGCAAAAAGT 
GTCCGATGTT 



CACACACACA 
CTTTTAATAA 
TGAATATGTC 
TGGCATAGAC 
GTCTATGCCA 
TCCAGTTATA 
TGTAGGAAAC 
ACATTACTGT 
TCCCTCACTT 
ATTGACAGTT 
TTATTGAGCA 
TTGTTTTGTA 
CAGAATATAC 
CTTCAAAAAC 
TAAAAGTTCC 
CCAGAGGCAT 
TCTATACACA 
TTACTTTCCC 
CTTTGGGGCT 
TTTGTATGCT 
AGACCCTAAC 
AAATGTTGTT 
ATATCCCCTG 
AAGAAGTGAA 
TCCTGGCTCA 
CAG CC AG AG A 
CGGACCCACC 
AGAATAAACA 
CCTGAAACAG 
GGCTGAAGTT 
GGTGAGATGG 
AAAAATTTAG 
GAAATTTAAA 
TATAGGAAAG 
TTTCGGACAA 
ACGTTATTAC 
CTAGCAATAT 
TCTGAGCAGA 
TTCTTTTCCA 
TAT CAT CATA 
GAATGAGACA 
GCTGGACCAG 
ATTTAACACA 
GCAGTTAGCT 
AAAGGGATTT 
AGGTTTGTAC 
TTAAACTACA 
GTCAGCTTTC 
CTTTGAATAG 
ATATTAGTGA 
CAGATTCAAA 
CTATTGTGTT 
GAGGCAGCAA 
CGTGTATAAA 
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106 921 TGCTCGTGGC TTGCTTTCTT TTCGCGTACC TGGTTTTTGT TGTCAGCTGG TTAGACATGT 

106 981 CTGGTCGCGG CAAAGG CGGT AAAGGTTTGG GTAAGGGAGG TGCCAAGCGT CACCGAAAAG 
107041 TGCTGCGGGA TAACATCCAA GGCATC AC C A AACCGGCCAT TCGGCGCCTT GCTAGGCGTG 
107101 GTGGGGTTAA GCGAATTTCC GGTTTGATTT ATGAGGAGAC TCGTGGCGTT CTCAAGGTGT 
107161 TTCTGGAGAA CGTGATCCGG GACGCCGTGA CCTACACGGA GCACGCCAAG CGCAAGACTG 
107221 TCACTGCCAT GGATGTGGTT TACGCGCTCA AGCGTCAAGG ACGCACTCTG TACGGCTTCG 

107 2 81 GCGGTTAATC TTTTCGTCAG TTTTCTT CCA ATGGCCCTTT TCAGGGCCGC CCACTCCCTC 
107341 TCAGAAAGAG CTGTGATTGT ATTCTTTCGG ATGGTAACAT CTCAATGGCT TTACTCGGCT 
1074 01 ATTCTGCCTA GTATGTAGAA CTATTATAAA CCAGTTGGGA GAGACCAGGT TGTTTGGTCT 
1074 61 GAGTGGCTGC TAAAGCAGAA ATCAGCTAAG TAAACGAGGT CTCCGAGATA AGTGAG CTAT 
107521 AAACTTCAAT GCTATAGTTT TGACATGTCA AGCAACTTAA CGTGCAGCGC GAGTCCGATA 
107 581 AATGAGTAGC TCAGCTTTTT AGTTTTAAAA ACGAGTTGTG CGTTATTTGT ACGAGAGCCT 
107641 AAGATGCTAG CTGCCTGGAA CTGAGTAGGT GGATTAAAAT GGGTGTCAGG TCTGTTTTCC 
107701 CAGGCGTATC TGACTTAACG TCAGCAAAAG CTGTACTTTT AGCTTCCCTG GTAACACCTG 
107761 CCGTCCTTAA CCGCCCCCTG CCGGTAGCGC CAGAAGCCTT TACTTCCATT TCTAGTTGAG 
107821 CTTGGCGTCC TGCTGAGTGA CGTCACCTCC CCCTTCTCTG GAGTAGGACT GGCGGTTAAA 
107881 GCTGCTTTGC TATTTTCAGT CCTCAGGCTG GAGGCTCCCC TAAGCAGGCT GCCTACGCAG 

107 941 TTCGTAAATT CCCACTTAGT AGACTAAGGG AGTCTGTTTT ATAAATAAGG ACTCAAATTT 
108001 CTTCTGACTC CGAGGTCCGT GGCAGCAGCT ATAAGATGGA AGCCCCCTCT GATGTAAGAT 

108 061 TCTCAGATGA CTTGCATCTT CACTGTACCT GTCAACCCAA TAGTCTTCTA TTCCTGCCTT 
108121 AAATTGTAAA TTCCAAAACT GATTTAATTG TGAAAGTTTC AAACTGTACG ACCTAGGAAG 
108181 TGTCAAAGTT AGGTGACCAG ATTTTTAGAA GTCAGCCAAA TATTCAGCAT CTTTGATTTA 
108241 GTAACAAATA TATTGATGGC TACTTCAGCA AAAAAAATCA ACTTTGTTTT CTGGTTACTT 
1083 01 TGCTAACAAG CTTCTCCTGA CAGGAGGATA TAGTGAATAG GCAGTTGAAT AAGTGAGTTC 
108361 GGGTGAGAGG TCTGAGCTGG AGATAAAAAT GTGTGAGTCA TCAGCAGATA AATAAATGCT 
108421 GAGACCAGAT GAGATGGCTA AAAACTGAAA CATAATGTAG TGCAGCATTG TTTGTAATAG 
108481 TAAATGAGTG GCAACTGTAA AGTTTTCATC AGAAAGGACT AGAGTGATCT ATACATCCAT 
108541 AAAATAGAGT ATTTCTCTAC ACAGCCCTAC TAAAGAATGA GAAAG CTGTA CTCCACTACA 
1086 01 TACTCTGGTG TACTCTGGCT CAGTTCTTGG ACTCCTCTTT TCTTGGCTAA CTCAACTGGC 
108661 CTCACCACTT ACATGCTCTG TG CTCTGTC A AATAGTTTGT TCAACAGAAC ACCACGGCCT 
10 8721 AGCTGTAAGT GCCACGTTAA CTTCTAGCAA TGCCAAAGCC TGTGATAGTG GCAGCTTCGG 
108 781 GCTGTTTCTC ATTCCCGGGA TGCCTAACCA CCTCTCCAAA TTCTATCAGT TTGCTTCCAC 
108841 CCACTTCAAG CTTCAGAACG AAACATAGAG CTTAAGAAAT ATAGGCCCGG CAAGGTGGCT 
108 901 . CACGCCTGTA ATCCCGGCAC TTTGGAAAGC TGAGCCTGGT GGATCACCTG GGGTCAGGGG 
108 961 TTCGAGACCA GCCTGGCCAA TATTGTGAAA CCCCGTCTCT ACTAAAAAAA AAAAAAAAAT 
109021 TAGCTGGGCA TGGTTGCGGG CGACTGTAAT CCAAGCTACT CGGGAGGGTG AGACAGGAGA 
109081 ATAGCTTGAA CTCGGGAGGC AGAAGTTGCA GTGAGTTGAG ATCGCGCTAT TACACTTAGG 
109141 CCTGGGAGAC AAGAGTGAAA CTGTGTCTCT AAATAAGTGT TTG CAATT AT AAACCATCTC 
109201 CCTGACCTTA AATCTCTAGA CTCATATACA ACTGCATATT TGATGTATCT AATTGAATAA 
109261 TGGGCATCTC GAACTTGTCC AAAATATGTT TATACGTAAA CACCAAGTCT GTTCTTCCTC 
109321 TGATATTTGT CATGTCAATC AATAGAACTC CATTCTTCAA GCAGCTTGGG CCAGGAATTG 
109381 TG C AATATTG TTTGTCCTGA GCTTCTTACA ACTTTCACCC AATGCAGTCA GCTCTGTTGA 
10 9441 AAATCAATCA GAATACCTTT CATTGTTTTC TTTGCTGCTT CTCTAGGAGC AAGCTGCCAT 
10 95 01 GGCGGTTTGT CTGAATGACC ACAGTGACCC CAAACTGGTC TTTGTTTTCA CTTTTAATCC 
10 9561 C C CTGTC AT A CAGTTTTTTC TCTATCCAGC ATCAACAGTG ATCCTTTTTG AAGGTATTAT 
109621 GTCCACTGTC TGCTGAAAAG ATTCCACTGG CTTTCCATCA CCTTCATAAT AAAAACCAGC 
109681 ATCCTTATCA TAGCCTACAA GTAAGATGAC CAACCATTAC AGTTTGCCTG ACTCTCAGGG 
10 9741 GTTTCTCAGG GTGTAAGACT TACAGTGCTG AAACTTAGAA AGTTCCAAGC AAACTAGGAT 
109801 GAGCTGCTCA ACCTACTAGA TCTGTACTCT GGCTACCCTC TGACCTCATT CTCTTCGCAG 
109861 TTCTTTCTCT TCACTGACCT TGCTGTTTCT GGAATGGACC AAGCATTTCC AGCATCAGCA 
109921 C CTTTATATC TATTCTTTCT CCCTAGAAGG GTCTTGTCCT GGATATCTGA ATGGCTCTAG 
109981 ATCTCATTTC ATTCAAGCCT CTCCTCAAAT ACCAACCTTA CGAAAGAGAC CTCCCATAAT 
110041 CATCCCTTGT AAAATAAGCT TTTCTGCTCA TTTAGCATAT ATATATATAG TTGACTATCC 
110101 TCAATAGCAT ATATATATAA CATTTCCCCA CCTAGAATTA TATATGTAAT AATATATTTA 
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110161 ACAAAAAATA CATATAACTA GATATATTTT ATTTTGTGTT TGTTCTCTCT CCCCCAACTG 

110221 GAATATATTT TTTGAAGGTA GGGACTTTGT TTTGTCCCAG AAGTATCCCT AGCACCTTGA 

110281 ACAGGGCTGA CGTTTAACAG GTAGTTTATG GAGGTTTGTT GAATGAAAGG ATGTGTGAAT 

110341 TTTCTATGTA AGTCTCCAGG CTCTCCACTA AGCCCACCAG AATGCTAACA CAATCAATTC 

110401 CCCATCTCAT TCCTTGACCT GCCACTGCCT GAAGCAATCA GCGTGCAGTT TCTCTTTAGA 

110461 AAATCTGGGG GATAGTCTAG GGGTTGCAAA TTAAGCAACA TTATCTTTGT TCTGAACAAG 

110521 GACTGCATGA GTGTTAGGAC TGAAGAAGGC CCAAGGTGGT GGTGGGTATG CCTAAGATGA 

110581 GTATGACATA TCAGCAATGC TATGAACATA GCAATGCTAT GAAAGGCCAG GCAAAACGTA 

110641 ACAGGAGCTA GTCGTGGCTT ATTGTTACAA CGACTATACC TCCCATATGG GTAATCGATA 

1107 01 TCCACACACC CCTCTACATT GACTCTGGAA TTCAGGAAAG GGAATTAAAA TTTTCTAACT 
110761 TATGTACCCC AATGATTTCA ACAATATCTG GCATATGAGA TCAATAAATA TCTTTAAAAT 
110821 AC CAACTAAG AAAGACATAA AATGACCCAC CCTCCATACC AGGCTCATTT TTGCTCCTCT 

1108 81 GATTCCTGAA ACTATCCAGA ATGCAGCTAT GAATTCTCTC CATTGTCAGT TTTAAATTAA 
110941 GCCAAGCTGG GTACTTGTGT AATTCCTCAA GAAATCCTGG ATGAAAACTG TCAGGTGGAA 
111001 AACAGGACCT CAAAATAAAG AGACATCCAT CACTGAAGCT AACATCGTGA GGCTGAAATC 
111061 AGTCCTATAA C AATGGT AC C AAAAAGAGCA CAATGAGAGG CATTTGTGAA TATTTACTCA 
111121 GATGAGAGTA AGATATTTCC CTATCAGCTA ACCTGAAGTT CACATCCCTT TTCCAGCTGA 
111181 GTTCTGAAGC TAGATGTACT TAACTGGAAC ACATAACTGC ATCAGGAACA TCCTTTAAAA 
111241 CTATGGCTAC CATGGCTTGA CTGGACAAAC CCCAGGCTTC CAGGTTTAGC ACAGGTGGCC 

1113 01 CTTCACAGAC CAACATTGCC TATGCTACCA ACCTCATGTC CTACCACCCT GCTTGCATCA 
111361 TTTCTCTCTC TGCATATATA AAAATATATG TGTATGTATA TAATCAGCTT TATTGATATT 
111421 TAATGTACCA CAAAATTTGC CCACTTTAGG TACAGTTCAA TGAATTTTAC CGTGTTTTCT 

1114 81 TAGTTGTACA ACCATCATCA CAATTTAATT TCGGAATATT TCTATCACCC" AAATTTCCAT 
111541 TTCTGCGTAA AGGGGGAAAA AAAAAGGTTA ACTGCTGAAG GCCGCGGTAA CACTGAAAAA 
111601 GGTGCCTTTT CTCTCTAAAA CAGATTTTAA TCTCCCCTGA ATTTAGTGTC CTGGGTATTC 
111661 CAGGAGTCTG AATAGGGTTT CAATTTTCAG GGTCTTTTTA ATAGAGTAAA ACTGTATTGG 
111721 TGGCGATAAA TTTAGTATTG CTCTCAGTAC ATGATTGAGG GATACTTAAA TGTCTCTGTG 
111781 ATTTTATTTC ATAATCGCTA AAAGATGGTT TTTTTTTTTC CTAAAACAGG GTTTTTGTTT 
111841 TTTCTCAATA AGCTTCTTAG CTTCCCCTCC GGCTCCCTGG CTTGCCTCAG GAAATATTAG 
111901 CTCATCAGTT CTGATTGGTT GACAGCTACG AATGGCCCTC ATTGATTGGG CAGCGCTTCT 
111961 TTGTCCCTTG GAAACTAATA CAAATTTTTA ACACTACTTT TTTTCCACTC TTTCTTCAGA 
112021 GTTGGAATAT CGTTGCTCCC CTACCCATAT GTAGTGAGTG GAGGGCAAAC TTGGAGTTCC 
112081 CCTAATCTTT CCTTTTTAGG ATGTCAGCTC AGTATCATTC ATCTTAATTA CACATTGAGC 
112141 TTCTTGACTT AATGGATACA GCTCTTCTTT TGTTTAGTTG GGCGGCCCTG AAAAGGGCCT 
112201 TTGGTTCAGA AATGCAAGCT GTGGAGAAAT CAGCAACCTT AACCGCCAAA GCCATAAAGG 
112261 GTGCGTCCCT GGCGCTTAAG CGCGTAGACC ACGTCCATGG CAGTGACTGT CTTGCGCTTG 
112321 GCGTGCTCCG TATAGGTGAC AGCGTCACGG ATCACGTTCT CCAAAAACAC CTTGAGCACC 
112381 CCGCGAGTCT CCTCGTAGAT CAGACCAGAG ATCCGCTTCA CACCGCCACG CCGGGCCAGA 
112441 CGCCGGATGG CCGGCTTGGT GATGCCCTGG ATGTTGTCAC GCAACACCTT GCGGTGGCGC 
112 501 TTGGCACCCC CCTTACCCAA ACCCTTCCCG CCCTTACCAC GTCCAGACAT GACTTCCCAA 
112561 GAAGTGAACC AAGAGCAAGT GAGAGAATAG GAAACCGATC TTTATATATC TACGTTACCC 
112 621 CTGCCCCCAC CTCCAGCGGA CACTGAGACT GAAAAGCGCG CAGGCGGGAA ATGTGACGCC 
112681 TACAGTCCGC TCCTTTAACC CCTCCTCCAA GCCCCAGGAA ATGGCGGGAG CAGCGATTGG 
112741 GGGAGGGTGG GGAGATGAGG GTGGGACCAA GCAGGCTTGA CCAATGGCCT TTATTTTCTT 
112801 AACAGAGCTA CAGGCTTTGA GGAACTGGGT TAAGAATTAA ATGTAAACCC ATTCTGACTC 
112861 CAGAATTATT TTAAGTCGAA CTTTTTTTTT AACCGAATCT CTCTGTCGCC CAGACTGGAG 
112921 TACATTAGAG CCATCTCGAT TCACTGAAAC CTCTGCCTCT CAGGTTCAAG TGTTTCTCCT 
112 981 GCCTCAGCCT TCAGAGTGTA GCTGGGATTA CAAGCGCTCG CCGTCGCGCC CGGCGTGTTT 
113041 TTGTATTTTT CGTAGAGACG GGATTCGGCC ATGTTGGCCA GGCTGATCCC GAACTCCTGA 
113101 TTTCTGGTAA TCCGCCCGCC TCAGCCTCTC AAAGTGCTTG AATTACAGGC GTGAGTCACC 
113161 GCGACCGGCC GAAATCGATT GGTTTTGAAG CCTTCAGTAG CATTAAAACG AAAAGTGCTC 
113221 CCAATGCATT CCCTTTTGTC TTAAATTGGT TTCTTACAGC TACTTTACTT GAAAAGGTGG 
113281 TGGCTCTGAA AAGAGCCTTT GCTTGGACCG TCAGAGAGAC CACAGTAATC ACGCCCTCTC 
113341 TCCGCGGATG CGGCGGGCGA GCTGGATGTC CTTGGGCATG ATAGTGACGC GCTTGGCGTG 
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GATGGCGCAC 
CAGAGCCATC 
GCGCACCAGG 
GCGGATCTCG 
GGTGGCCGGA 
AC C AGTAG AC 
AACACCCAAC 
AGTGATTGGA 
AAGCCAGCAA 
ATTATTTAAG 
AGCTGGTCTT 
TTACAGGGGA 
CATCTGGTTT 
AAAGGTATTA 
AAAAAAAAAA 
TCTGAAACCT 
ATTGAAATCT 
CTATGAGACG 
CTTTTGTTTT 
TGCCGCCAGG 
CCAAGAGAAA 
CAATCATTTT 
GAGACAGGCC 
AAAGCAGATC 
AACCCAATAG 
TAATTAGGAG 
AAGAAAGGTT 
CTGTGTTTCA 
CAAATTTTTA 
AGGAAGCACT 
TGGCACGTTG 
TTTGAAAACT 
TGTGCTAGGG 
ATTCTCCCAT 
TTGAAATGCA 
CAGGGTGTGC 
AGCAGAATGC 
TTGTGAGGCC 
GATTGCGTGT 
GTATGGATGA 
AAGTTGCAAG 
GAATTCTGGT 
TCGCTCTAAA 
CTAAAACTCG 
TGCAAATTAA 
CGGATGCTTG 
GCTGCTTTTT 
TTTCCTTCAC 
AGCAGGACCA 
CAGCGCGCAG 
AGTCCCACCC 
TTATACCACT 
CCCGCCGCTT 
AAGGCTG C AG 



AGGTTAGTGT 
ACAGCGGAGC 
CGCTGGAAAG 
CGCAGAGCCA 
GCGCTTTTGC 
TTCCGAGCAG 
ACTAGCGCAA 
TGATAGAAGA 
CAATCGTGCA 
TGGTATTTTA 
AAACTTGGGC 
GCCCCACTGC 
TCATAACCTG 
TAATTCCCCA 
GAGGGAATAC 
TTCACAAGAA 
ACAAAGCATC 
TCTTTCTCTT 
CTAAAGCCTA 
TACCACCAGC 
CTGGATAGTG 
GAAAATCTCA 
AC ATTCTAT C 
TATCATCCTT 
AAAAAAGGGA 
TATTTCCTTT 
TATATCTTTC 
TAACTGACTA 
GAAATTACGT 
GGTGCAGAAG 
TTTGAACAAA 
ACCACAGCAG 
GGTTATCCAG 
TAGCTGAGTC 
CTTAACAGCC 
TTGCATTTAT 
CTGTCAGGGA 
CATAAATATT 
GCTGACATGG 
AAAGGGCATT 
TGCAGAAACG 
GTTGTCTACG 
ACATTGCCAG 
CACTTTTCTC 
AAACTAACAT 
TGGCACTGCA 
GAGAGAGAAG 
AATGAGGCAA 
TAGGCCCTAG 
GGGGCGCTAG 
TATAAATAGG 
TTATTTGGTG 
CTGCTGCTCC 
CAGCCTCCAA 



CCTCAAATAG 
TCTGGAAACG 
GTAGTTTACG 
CGGTGCCCGG 
GGGCTGCCTT 
TTTGCTTAGT 
ATACGCCCAT 
CGCTAAATAT 
GTTTCACCGG 
TTACTACTAT 
TCAAAGGATC 
GCCGGCTTGG 
AAGGCTGTGT 
ATTCCGTATA 
TGCTCACCTC 
TTGGATTCCT 
TCAAACATAG 
GATTATGCTC 
GGTGTACTCT 
TGGGAGTTGT 
GTTCGCAAGG 
AAACACTGAA 
TTTTGATTGG 
CATTTGCATG 
GGCAGAACCC 
TCAAAAGTTG 
ACAAAGGGTT 
GCCGTCAAAC 
GAAATATTTG 
ATGGGTACAA 
AAGGGGAAAA 
GAGCGGAAAT 
AATAGGATTG 
TCTGATTGGC 
ACAAACAAGT 
CACCCGTGTT 
ACCGGTTTCG 
TGTTGAATAA 
AACACAGGTT 
GGAAATTCCT 
TTTCCACACT 
ATTCATTCTT 
AAAATGTAAT 
TCCCTCCGCA 
CTCTGGCAAC 
TTTGTAAACC 
CGGTACCCTC 
CCAGAGCGGC 
AGGCCCCCAG 
GGCGCGAGGG 
CTGCGTTGGG 
TGCTGTGTTA 
TGAGAAACCT 
GAAAAAACCC 



CCCTACCAAG 
CAGGTCTGTT 
AATAAGCAGT 
CCGGTAGCGG 
AGTGGCCAAC 
GCGAGCCATG 
GAGCTGCTCT 
GACGTTACAC 
CTACTATATT 
TATTTTATTT 
TTCCCGCCTC 
ACTTTAATTT 
TTATTTTCCA 
AC CTTC AGCT 
CTCTCCGGAA 
TTGTAATGCT 
TAGGATTACA 
TTTGAATCCT 
GCCAGTCACA 
TCCTCTTGCG 
AACATAATTT 
AAGTGGATTG 
TTTAGGCTAT 
GAGCGTTCCC 
ATTATTTAAA 
CGTTTTTTCA 
TACTTACAAA 
CAAGATGTAG 
AATGCATGCC 
TACTTATCTG 
GCTCAGGTTA 
AAGACCGCAT 
TAGAAGTGGA 
AATGTGAGAT 
TAAAGGGTTG 
TGCTTTCACA 
TGGACCCAGC 
AAGAATGAGT 
GTAAACCTTA 
GAAGTGCATC 
TGCAGTTTGA 
GTTTGACGTG 
AGAGTTGATG 
ACTATTCAAA 
GGACCTCTAA 
GCCCCCTCTC 
TGATGTTACT 
TTTTTCTGTG 
CTGCCTTCTG 
GCGGGCACTG 
GCCT.TTTTTT 
GTCACCATGT 
TTAGCTGGCA 
GCTGGCCCTT 



TAGGCCTCGC 
TTAAAGTCCT 
TCAGTGGACT 
TGGGGCTTTT 
TGTTTGCGTG 
ACGGAAAAAC 
ATTTATAGTG 
ACTCTGATTG 
CTATTCCAAC 
TACTTTTGCT 
AGCATCCAGA 
TTTAAACTTG 
TAAAACAAGG 
CTTTAGGAAA 
ATGTACCCTT 
TTAATTGACT 
CTATTACTCA 
AAACTTGCAG 
AAATGGCGTT 
GAGCAGGAGG 
AGCATTGCCA 
TGACCTTTTT 
TTTCTTGAAC 
ATTTTATTTG 
GTGGAAACTC 
GATACCTCGC 
AATCTTCCAA 
AGTTTCCAAC 
TTCTCAATAA 
GGACCACTCC 
CTTAGCATGG 
TACCTCACTC 
TGTCGATTTA 
CGTTTTAGCT 
TTACCATAAA 
CTAAGTGGAC 
ATTTAACGCC 
TGACCATGTC 
ATACCAATTT 
CCACATTGGA 
GTATTAATTG 
AAAGGTATTC 
ACAACTGGCC 
ACACTGTATT 
AAATTTCTAA 
AACCTACTCC 
GGGCGGCAGT 
TGTTTGCTTG 
AGACTGGGCG 
ACGGGCACCA 
CGCATCCTGC 
CTGAAACAGT 
AGAAGGCAAA 
CCGTGTCAGA 



ACGCCTCCTG 
GCGCAATCTC 
TCTGATAACG 
TCACGCCGCC 
GCGCCTTGCC 
AGCACAGCGG 
TGTAAAGTGC 
GTCTATCTTT 
TCTACAGATG 
TTGTTCCCCA 
GTAGCTGGGA 
TCCTCTTCTA 
CATTGATTCC 
AAAAAAAAAA 
TACGGGAATT 
TAGGAGTGTT 
GAAACATTTT 
CGTTCTGCAG 
TCTCCAGCAC 
TGGACTTGGC 
AGAGCTAATG 
AAATTCACAA 
AGCCATTTAG 
AAACCAGTTT 
CTGAATCAGA 
TTATTACACT 
TTTTGTATAC 
CGTTATTTTC 
AATGGGACGT 
ATTATTTGGT 
TTCGGACTTA 
TCTGCTGTGC 
ATAGTTTTTT 
TATTGATACT 
ATCTTATCCC 
TTAACTCCCC 
TTTCGCAGGC 
ATGGTGCGCT 
GGGGCATGTT 
CTGTGGAAAT 
C AG CGTTTGT 
GCGAGACACA 
CTAACACGGC 
TTACATTTCT 
TAAAACTCCT 
CTAAAAAAGA 
CTGCCTACAA 
CGTTGAGGGG 
AAACCCTCGG 
ATCACGGCGC 
TTCGTCAGGT 
GCCTCCCGCC 
GAAACCTGCT 
GCTGATCGTG 
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CAGGCTGCTT 

CTGGCGGCCG 

AGCCTGGTAA 

AAGCTCAACA 

AAAACTAAGG 

AGCGTCAAGA 

CCAAAAAAAC 

GCTGTAAAAC 

AAAGCGGCAC 

CTTTTAAGAG 

AAGCAAATG C 

ACATAGCCTC 

AAATGCACCG 

GCGTCTCACG 

CCCAGGCTTG 

ATATTGCTAA 

TGGCAAAATA 

CTTGGGAAGG 

CCCCGCCAGG 

ACGCAAAATG 

GGTTTAGGCC 

AAAATCCCTT 

AAATTACTGT 

GAGCCCACAC 

ATTTTAATTC 

TATATGATGA 

TTTGCGTGTG 

TGTTTTGCTT 

TCCCCCTTCC 

ACCTATTTTT 

TC TAT AT AC A 

ATCTTTTGAG 

CACACTGTAA 

CTGGGATTGC 

GTCACACCAT 

AGTGTTGGGA 

TAGGGTTATT 

ACTTTAGGAA 

ACATGGTGAA 

ACTTATAGTC 

GAGGCTGCAG 

CAGTCTCAAA 

TTGCTTTTCT 

TGGAATAGCA 

ACTGCTGAAC 

GTATTTTATT 

TGGTGGAGTA 

TCATGCCTCT 

TTTTTGTATT 

TGGCCTCAAG 

ATGGCGCCAG 

GTATCACACT 

TTCTTATTTT 

CTTAGAGCTA 



CCTCCTCTAA 

CAGGCTACGA 

GCAAGGGAAC 

AGAAGGCGTC 

CAACGGGTGC 

CTCCGAAAAA 

CCAAAACTGT 

CCAAGGCGGC 

CCAAGAAAAA 

CCACCTACGC 

AACACGCCCT 

ATCTAGTAAG 

AGTTAAAATC 

ACTCCAAGCT 

AGGCTTTTTA 

AGTAGCATTT 

TATGGCTTAA 

ACGGTG AC C C 

TAGGCCTAGC 

CTTTCTTTGT 

CTTGTCAGGA 

GCTCGGTTTC 

GCTTAACTGG 

ATTCAAAACT 

TGGGGGTCAG 

GCCAAGTTTA 

TGCATATGAG 

AGCTCCTTCC 

TCAAACTAAT 

GTGAGAAATA 

TGCATACTTG 

ACTGGGTCTC 

CCTCCAACTC 

ATGCACGCAC 

GTTGTCCAAG 

TTACGGTCAC 

AATTTAAAAA 

GCCAGATGGG 

ATCCCATCTT 

CCAGCTACTT 

TG AG C AG AG A 

ACAAAAATAA 

CTTAACCAAA 

TAAGTTATTC 

ATGGTGCAAT 

ATTTTATTTA 

CAGTGCCATG 

GGTCTCCCGA 

TTTAGGAGAG 

TGATCTGGCC 

ACCTGGACTT 

GAAGACTGAT 

TTAATTTTAA 

GAGAAGCCTT 



GGAGCGTGGT 

CGTGGAGAAG 

GTTGGTGCAG 

CTCCGTGGAA 

ATCTAAAAAG 

GGCTAAAAAG 

AAAGCCCAAG 

CAAGGCTAGG 

GTAAATTCAG 

ATTTCAGGAA 

CAATTATATT 

AATTTACTAC 

GAGTTTTAAA 

AATTAGTCAT 

TTATACAAGG 

TCCGAAATTG 

CCACGCCCTC 

TGCTGGCGTG 

TCGCTTGCTT 

GCAGCGCCTT 

CAAAGGAGCT 

TCTGTTTTTA 

ATCGTGTTTC 

GAAGATCCTT 

ATTTTAATAA 

CTCACTTTTA 

TTAATAATCA 

ATCGTCTAAA 

ATGTAGCTAC 

TACATGTTTT 

TGGTTTTGTT 

AGTCTGTTGC 

CTGGGCTCAG 

CACCAAGCCG 

CTGGTCTAGA 

TGTGCCTGGC 

ACAAAGCCTG 

CAGATTACTT 

GACAAAAAAT 

GGGAGGCTGG 

TCGTGCCACT 

TAAAAATTTT 

CTTTTCTAAA 

ATCCATTTCT 

TGAATAGAAT 

TTTATTTATT 

ACCTCAGTTG 

GTAGCTGGGA 

ATGGGGTTTC 

ACCTCGGCCT 

TGTCTTCTGT 

GATTCTATAT 

GGCAATTTTA 

GGAAGTCATT 



GGTGTGTCGT 

AACAACAGCC 

ACAAAGGGTA 

ACCAAGCCCG 

CTCAAAAAGG 

CCTGCGGCAA 

AAAGTAGCTA 

GTGACGAAGC 

TTAGAAGTTT 

AAGAGCTGTA 

AGAATCACTT 

TCAATCTATC 

GTCACCTGGG 

AACCGTATTG 

TTAAAGTGGG 

GGTGGTCCTA 

TCCACAGGAG 

GCTGGCGCCC 

TCTGCAGCGC 

ACCATGGGTG 

TAGTTTGTTG 

GAAACGGAAG 

ATCAGTCGTG 

TTCTCAGAAC 

TTGGACTTTT 

CTTAGTGCAG 

GTTGTATTTT 

GTCAGGGATA 

CTAGGTTTAT 

TCTTTGAACT 

TTTTTAAAAA 

CCAGACTGGA 

GCTATCCTGC 

GGCTTTTTGT 

AATGGCCTCA 

CTTGTATGCA 

GACGCAGTGG 

GAGCTCAGGA 

ACAAAAAATT 

GGTGGGAAGA 

GCACTCAAGC 

TTACAACGAT 

ACCCTGTCAT 

TATTGATAAG 

TCCAGGGCTG 

TATTTAGACA 

CAACCTTTGC 

TTACAGGCAC 

ACCATGTTGG 

CCCGAAGTGC 

TTCATCAGTC 

AAATATGGTA 

GATTCCAGCT 

TAGTTTTTGT 



TGGCAGCTCT 

GCATTAAGCT 

CCGGAGCCTC 

GCGCCTCAAA 

CCACGGGGGC 

CAAGGAAATC 

AAAGCCCTGC 

CAAAGACTGC 

CTTCTAGTAA 

GTACACAGAT 

GGAGAGTCGA 

AAAGATAGCA 

TTTCGGTAGC 

AACCAAGGTT 

GATATTGCGT 

AGAAATGCTT 

TGGCTAGCGA 

ACGTTGGCGT 

CATCATGACA 

CACTTACGGG 

GAGTTTTAGA 

CGCCCTGATT 

CAGGATTTTC 

TGCCCCTTTA 

TTGTTTACAT 

TTCAATTCTA 

TCAAACGGTC 

CAGGCACATC 

CCTTTAAAAC 

AAGTATTTTA 

AAAAAAAAAA 

CTGCAGTGGC 

AGCCTCAGCA 

TTTTATTTTT 

AGTGATCATC 

TAATTGTTTT 

CTCACATCTG 

GTTCAAGACC 

AGCAAGGCCC 

TGACTGGAAC 

CTAGGTGACA 

GTTATATACA 

GAAAAAAGAA 

CATTGATGTT 

AG ATTG CT AG 

GAGTCTTACT 

CTCCTGAGTT 

CTGCCACCAG 

CCAGACTGGT 

TGGGATTACA 

CTTCTGTTGG 

AAGACTGTAC 

TTCCAAAGAA 

TTTGTCAGAG 



TAAAAAGGCG 

GGGCATTAAG 

GGGTTCCTTC 

GGTGGCTACA 

TAGCAAAAAG 

CTCCAAGAAT 

TAAAGCTAAG 

CAAACCCAAG 

CCCAACGGCT 

GAAATCCCCC 

TAGAACTTTA 

AGGTGAATTC 

CGGAAGTCCC 

GAAGCCCAGT 

TTTGGGGTCA 

CTGGGATAGT 

GCTGTCTGTC 

CCTCTGAAAG 

AAGCTTTGAA 

CTGTCGACTT 

GCTGCAACCC 

GGATATTTGA 

AACCCTGGTG 

AGCTTTTGCA 

CTGACAAGAG 

AAAGTTTATT 

TTTTTTCAAT 

ACATCCCTGT 

AAAAATTCTC 

CATACACCTA 

AAAACACGTT 

ATAATCACAG 

TCCGGAGTAG 

TGTGGAGACA 

GACCTCCCAA 

GTCTTTTGAT 

TAATCCCAGC 

AGCCTGGGCA 

AGTGGCACGC 

CTGGGAGGTA 

GAATGAGACC 

CTTCTGCATG 

ATCCTTCACA 

TCCAGTTACC 

GTTTTAGGTT 

CTGTCACCCA 

CAAGCGATTC 

GCCTGGCTAA 

CTCAAACTCC 

GGTGTGAGCC 

TTCAAGCACA 

ACCCTAACTG 

TTGTGGAATG 

AAAATTCTGT 
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AGAGACTCTG 
GGCAATAATA 
GCAAAAGAGT 
AACTGATTTA 
ATAAGAACTC 
GAGAAAATAG 
TTATAGGATC 
AGCCTAGATC 
AGGAGACATG 
AGGAGTACCA 
TTATTCTTTT 
GTCATTATTT 
AAGCAGTCCA 
TTAAAATCCT 
GAGAAGCATA 
GAACACAAAG 
CCCTCCCAGA 
ATAACCCCTA 
AATGGAAATA 
TTTGAATAGA 
TATTCCATGT 
TTTTCAGCTA 
GAATACATAT 
TTTGAAAGTG 
AACAGCATGG 
CTTCCATATA 
TCCATTGTGG 
TTTTTTCTTA 
TAGGTAGCTT 
CTAAATTGAG 
AGTTGCTGTC 
GAATACAGAT 
CTCTGGCAAG 
ACAGCAGTCG 
GTCTCTTCCT 
CTCATTTAAC 
ATTGGGTGTT 
TGTTTTTTCT 
TTGAGGTGGA 
TGCAACCTCC 
GTGCATACCA 
TTGGCCAGGC 
ATGCTGAGAT 
AAATTTTGCT 
GTCTTCACTC 
CCTCCCAGGA 
AGGCCACCAT 
TGGTCTCAAA 
TAGAGGTGTG 
AATTTAGTGT 
TTTAAAAAAT 
GAAATTAGTT 
GTTCCCAATC 
GTTAGCCTCA 



TCCTGCTCTC 
CCTTATGGAC 
GAGAAGGAAA 
TCACTCCCCC 
CTAACTCCAA 
GTTTGGTAAG 
ATTCAATTAT 
TTGCCTGTAT 
TTGAAAGTGA 
TGGGAAATAG 
ATTTATAAAG 
TAATTTCATA 
TAAACCTACA 
TCTCATTGAA 
ACATATAAAC 
TGAAC AC CAT 
GCCGCCTGAT 
TTTTGATTTC 
AAACAAATGT 
TTCATCTGTG 
TATGAGTGCA 
ATATAAACAA 
TTTGTTTGTA 
GTGATATTAG 
ATGAACCTCA 
CTTCCCAATA 
CAGATGTGCA 
ATTGGATTGT 
GAACAAAAGG 
GTGCCAAGAG 
TTAGATTGTT 
TTATTTCTTA 
GGCCTTCTTA 
CCTTTTGTGT 
TTTCTTATAA 
CTTAAATATC 
AGAGTTTCAA 
TGTTGGTTTA 
CTCTTGCTGT 
ACCTCCTGGG 
CCGCGCCCTG 
TGGTCTCAAA 
TACAGGTGTG 
CACGAACCCT 
TGTCAGAGGC 
TCAAGCGATC 
GCCCAGCTAA 
CTCCTGAGCT 
AGCCACCACT 
ACTCAAATTA 
CATTTTCTAT 
CTCAAGACTA 
AACCTTAGGT 
TGGTTACAAT 



ACTGAATACC ATCCCATAGT 
AGTATGCTTT TCCTCAAATA 
GTAGAATAAG TTATCTAAGA 
TTCCAAACTC TCTTGAAGGT 
GGGAGGAAGG TAAGTTATTC 
CATCCGCTTT CTGCTACCAT 
TGGTTGGCTC TTGAGAGGGA 
AGAACCTCAT GATGTTATGC 
CCCATAAATC TGCAGTATCT 
CATTAGGTCA ATGACAGTAA 
TTGTTAATAT GCTACATAGT 
TTTCACTATT GATAAATGAA 
TATAAAGCAA ATTGGAGATT 
AAAAAATTTC GTATTAGAAG 
AGAAAACCAC AGCAAAACAA 
AATAATTGCC ACACAAGTAA 
GCTTGCTTCC AGTCACATTA 
CAATGCTGTA ATTTAGTATG 
AATCCTATGT ACCTGACATA 
TTGCTGTGTA TAACTTTAAT 
ACAATTTAGG TGTCTACTGT 
TACCGTGAAT ATTCCTGTGT 
TACCTAGGAG AGGAATTGTT 
GTTTACATGC GATGAAATGA 
CAAACCTAAT GTTGATGGAA 
TTTTTTTCCA ATTAAAATTG 
GTATTATCTC ATTATGGTTT 
ATATCAGTCG ACTTGGGCTG 
AGTTTATTAC CTCACAGTTC 
ATTCAGTTTC TAGTGAGGGC 
TGGTGCTGAA CAGAATACCA 
CAATTCTGGT GGCTATAAAG 
CTGTTATGGC AGATGTGAGA 
CCTCATGTGG CCTCTTCATA 
GGACACCAGA TCTATCAGAC 
TCCATAAAGT CCCAAAATCC 
CAT CAATTTT GGGGGAACAC 
AGATAGCTGT CTTTTTGTCC 
GTCACCCGGG TTGGAGTGCA 
TTCAAGAAAT TCTCCTCCTC 
CTAATTTTTG TATTTTTGAT 
CTCCTGACCT CAGGTGATCC 
AGCCACCAAA CCTGGCCTGT 
TTATCCATTT TATGTGTTGC 
TGGAGTGCAG TGGCACAATC 
CTCCCATCTT ATCCTCCTTA 
TCTTTGTATT TTTTTGTAGA 
CAAGCAATCC ATCAACCTTG 
GCACCCAGCC AATGATATCT 
AGCACACTGC CCTTTTATGC 
TTCAAGGTCA TGAAGATCTT 
CCCTCACTTC TAACACCAAT 
TAGTAATTTG CTAAAAGGAC 
TTATTATAGG AT AT AT AG CT 



ACCCCCCAAC 
TATTCTAAGC 
ATCAGTGGGT 
CACTCTGCCT 
TTATTCCTTG 
TCTCTGTGTT 
ATGCAAGGTT 
TTCTCTAAAA 
CATGTCTCTC 
CAACTCCCAG 
CCCTAATTTT 
GGAAAAAATG 
TTAAAATTGA 
ATTTCAACAT 
AAATGCAAAG 
AAAAACAGAA 
TCACTCCATC 
CCTGTTTTTG 
TTTCACTCCA 
TCATTTTTAT 
TG ATG CAT AT 
ATGTGTCTTG 
GGGTCAAATG 
AAATTAAAAC 
TCTAGCTGGG 
TTAATCTTTT 
TATTTTACAT 
C C AT AAC AAA 
TAAAGGCCAG 
TCTCTTATTG 
GAG AC C AAAT 
CCTATGGTCG 
TGTCATCTCA 
TGCCCATAAA 
TACTGGCCTA 
CTATCTCCAA 
AATTTAGGCC 
TTTTTGTCCT 
GTGGCGCTGT 
CCAAGTAGCT 
AGAGACGGGG 
ACCTGCCTCG 
CTTTTCTGTT 
AGGTATTTCC 
ACAGCTCACT 
GTAGGTGGGA 
GATGGTGCTG 
GCCTCCCAAA 
CATGATGCAT 
AC AAC CTTTT 
ATTTTATAAT 
TATAAGTTGG 
TCACAGAACT 
TATTATGTCA 



AGCTTTAAAG 
CATGGTCAAT 
GCTCTCTTTA 
CCCTTTCTAC 
CTTAGAAAAA 
TCTGTGTTTT 
CAAGGACACA 
TGAGGCCTGG 
AATG GGGACA 
GTGAGTTGAT 
G CCACAAAT A 
AGTAGCAGTT 
TTCTGGATGC 
TCTTTAAACT 
CTCAATAAAT 
AATCAGCCAA 
TGCCCTAAAC 
AAACATATAA 
GAACATTAGG 
TGTTATGTAA 
TTGCTTCCCT 
GTATATATAG 
CTAAACTCTT 
CACAGTTATA 
AATTCCTGTT 
GAAGATGTTA 
CTTTTGCCCA 
AATACTAGAC 
GCCAGAAATC 
ACCTGAAGAT 
AATTTATAAA 
AGGGGCCCAC 
TATTCAAACC 
ATGACCTCAT 
CTCTTATGAC 
ATATAGGCAC 
AAAAAGATTG 

yji f^p fj< i^i I J 

CTCAGCTCAC 
GGGACTACAG 
TTTCACCATG 
GCCTCCCAAA 
TTAAGTTTTT 
TCTGTAACTT 
GCAGCCTCCA 
CTACATGTGC 
TTGCCCAAGT 
GTGTTGGGAC 
TAAAGTCATT 
TTGTATCTTA 
ACCTTCTTGT 
GAGGTCTGTG 
TGCTGAAGCT 
TTCCAATGCA 
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ATGTAAAATT 
TGAGGTATAC 
TTCATTGTTT 
TGAGAAAAGA 
CCCAGAAGAT 
AATAGGATAG 
GTGACTTAGA 
GTCACGTGAA 
ATGTCAAAGG 
AGGCTTCAGT 
CTACAAGAGA 
GCCCTAGGAA 
CCTCTCCCAG 
ACGTACAGAG 
CTGAAACCAT 
TGATAAGGCT 
ATTGTTAAGA 
ATTATTTTTG 
GAACTGTATT 
CACATGGCCG 
TCCTATTATA 
ACACTTATCA 
AGGCTAGACC 
TGTTTAACTT 
TGAAGTAGGG 
ATACAATTGT 
CTTTAGATGT 
TGGACAACAC 
ATTTGTTTAT 
TATAATAAAG 
GACAATTAAT 
AGTTTTCCTG 
CGTAGAAAAT 
GCCGATGTGG 
AACCTCATCT 
CCAGCTAATC 
TGAGCCAAGT 
AAAACAAAAA 
ACAGTTATTA 
ATGGCCAGTT 
GAAAATCTTT 
CGGGTACAGT 
CAGAGCACAA 
AGTTAGACTT 
CTGCTCCTTG 
CGTGGCCTCA 
CCTGAAACCA 
TTTCTTTAAA 
ATTGAACCCA 
CACCAATTAT 
TATTTCTTTT 
CTACAGTAAT 
CAGAACTGGA 
AAGAGAGTAG 



ATACAACTAC 
AAACAGCAGA 
GTAATAG TTG 
AACACACAAG 
ATGTTTAAAG 
TTGCAGATAT 
AGAAGTAGTA 
GACAGACACA 
TTG C C AG C AA 
GG AAG CAT AG 
ATATATTTGT 
ACTAATATAG 
TAATATGAAC 
CAGTCCCCAC 
GGATAGTACT 
TAATTTATAA 
ATATACTGTA 
GACTGCAGTA 
TCACCCGAGC 
ATCTTTTAGT 
AATCACATTG 
GTGAGAACAT 
TCTTTTTGGG 
TATTTTGCTT 
GGTCAAGTTT 
CCCATGCCAT 
AAATCAATGT 
TGCCCTGATT 
TTTGGGCCCT 
CTTATTGGGA 
ACCTTTAAAA 
TTTAATTTAT 
TCAAAGCCCA 
GTGGATCACC 
CTAGTAAAAA 
AGGAGACTGA 
TCCTGTCACT 
AAAGAACATT 
AAATATCAGG 
AGAGAAAAAA 
GAGAGCCTTG 
TCCTTGTTTA 
GGACCTCCAT 
CCAAAGAATG 
GAGTTCGGTT 
GTGTCCGGAT 
C C AC AAGT AT 
AATTCAAATT 
CAGCTTTTAA 
TTAATTATTG 
ATATTGCATA 
AGTTCAAAAG 
GGCAAGAAAA 
CACTGAAATT 



TTTTAAAAAG 
TATGTGTGCA 
AAAAGGGGAA 
GCAGTATTAT 
TCCCAACTCC 
AATTAGTTAA 
TATATATATT 
CACAGGCAGA 
GTACGAGAAG 
ATCTAATGAT 
TGTTTTAAGC 
GCACAATCCA 
AGTATTACTT 
TTATGCACAA 
GAACTCTATA 
ATTAGGCACA 
TAAAAGTTAG 
G AC CACAGGA 
CTCAGTGTGC 
CTACCTCCAC 
TTGACTGTGT 
TTCAAGGGTC 
TAAGATAAAT 
TTCATGTTAG 
CTTTTTTTTT 
TTATTTACAA 
CCATATTTGT 
ATTGTCATTT 
TGGATTTGTG 
ATCTGATTAG 
TATTGAC CGC 
CTGAGTAATA 
AGTG CAATAG 
TGAGGTCAGG 
TACAAAAATT 
GGCAGGAGAA 
GCACCCCACC 
CAAATAATCA 
AT AT AAAAG C 
AAAAGAATAG 
GCCCTGCCCT 
AAAAAATTTT 
AACACCGGAC 
GTGTTTCCTC 
TCAAATCTAC 
GTACGGTGGC 
TGTTTCATGT 
ACATATTTTG 
AACCTACTGA 
ATCAATATTG 
TTTTATATTT 
TGTACATCCA 
TGTCACAGTA 
GAAGAAAAAT 



ATTTTAGCAT 
CAT AT AT AC C 
ACAACTCAAG 
GGATCGAATT 
CAGTACCTCA 
GATGAGGTTA 
TTTTAATAGA 
GACTGAGGTT 
CTAGGAAGAG 
ACCTTCATGT 
CACCCTAGCT 
GGCAAGTTCC 
TCCCAGCATT 
AACATATGTT 
TAGCTGTTTT 
GTAAGAGATT 
GTGAATGTTT 
ACTAAAACCA 
AGTTTTAATG 
AGGTAGAGCT 
GGTGGTCAAA 
TAAAATTCAT 
TTTTTACCAT 
TTCCCCTGGA 
CCTTTTTGTT 
GAGTCCTTTC 
TTGAGCCTGT 
TATCAGTTTT 
TATTAAATTT 
GATTACAATG 
TTCAACTGTA 
CATTATAGTT 
CTCATGTCTG 
AGTTTGAGAC 
AGCTGGGTGT 
TCGCTTGAAC 
CTGGGCGACA 
ATGTAGATAA 
AAAAAAATCA 
GCGAGACTTA 
CAGGGATTTC 
GCTCCATCAA 
ACTAGATGTC 
TGTCCCCAAA 
AAGGCTGTCA 
CTTGGCACCT 
TATGTATGTT 
CAAGCCCCTG 
ACACTTTGCT 
TTTCCTTAGT 
CTGCATTACA 
AAATTTAGCT 
ATTCTAAAAA 
AGATGCGTTT 



TTGACCCAAC 
AAGACACATA 
GAATAAAGAT 
GTATGCAGAT 
GAATTGTGGC 
TAGTACAGTA 
ACTAGTATTC 
ATGCAGCTGC 
TCAAGGAAGG 
CAGATTTCTA 
TCTAGCTCTT 
AAATATGAGC 
AATGTGTGAC 
CCAGGACCTC 
TTCCTATACA 
AATAACAATA 
ATTTCTGAAA 
TGTAGAAACC 
GCCTGCCATG 
GATACTGTGT 
ACCTCCAGGT 
CTCCCAGTAG 
ATACTTTATT 
ATTGTTTTTT 
CTTTTTCTGT 
ACCATTGTTG 
TCCATTCGTT 
GATATTTAAT 
GAACCCTGTT 
GTTTTGTAGA 
AATATACTCC 
TTCTTCGTAG 
TAATACCAGC 
CAGACTGGCC 
GGTGGCGGGC 
CCAGGAGGCA 
GAGCGAGACT 
TTCAAATAAC 
ATAACCTCCA 
AAAAGGCTGG 
TCTGGCTTCA 
TCAACAAGGG 
TAAGGGACAC 
CTCTGGAACT 
TGGAGGTTGC 
GAATGTGAGA 
TTTTCTTATC 
AACAAGCTTC 
CTATGTTGTC 
GTTGGGATCA 
GTTATTACAT 
GTGAAGTGGA 
AGATGATGTA 
GAGAGAAAAT 



AATTTCACTC 
CACAGCAAAA 
TAAAATCAGC 
CTCCCTTGCC 
CTTATTTGGA 
TGATGGGCTG 
TTCTAAGGTG 
AGGTCAAGGA 
ATTTTCCTAC 
GCTTCCAGAA 
TGTTACAGCA 
TTCCAGTTGT 
AATACACATG 
CAGTGGATGT 
G AC AC AG CTA 
AATTAGAATA 
TTTACCGTTT 
GTATACAAGA 
GTTGACTGCT 
GGCTCAAAGT 
AAACAAAGAC 
CTGAGGGCAA 
TTGCTTTTCA 
GTGTATAGTG 
TTAAAAGGCT 
TATGGTGCCA 
TGTCTATTTT 
AAAGCAACAG 
TGTCAATTTC 
TCAGTTTGGG 
TCCATTATTT 
AAGTCAGATA 
ACTTTGGGAG 
AACATGGTGA 
ACCTGTAATC 
GAGGTTGCAG 
TCGTCTCAAA 
TAAAAAATGA 
TATATACAAA 
GAATCTCCCT 
TGCCCAGATA 
GCTCCTTCCT 
CTCTTAAGGA 
CACAGCACAA 
AGACCAAGTC 
ACATGACCTC 
TGAAATTCCT 
ATGAGCATTT 
ATTCACTATC 
TTTATGCATG 
ATTACTTTTG 
TGGACTGAGG 
C AATT AG AG C 
TAGGAGGTAG 



Figure 8 (Page 39 of 73) 



SUBSTITUTE SHEET (RULE 261 



WO 98/14466 



PCT/US97/17658 



55/162 



126361 

126421 

126481 

126541 

126601 

126661 

126721 

126781 

126841 

126901 

126961 

127021 

127081 

127141 

127201 

127261 

127321 

127381 

127441 

127501 

127561 

127621 

127681 

127741 

127801 

127861 

127921 

127981 

128041 

128101 

128161 

128221 

128281 

128341 

128401 

128461 

128521 

128581 

128641 

128701 

128761 

128821 

128881 

128941 

129001 

129061 

129121 

129181 

129241 

129301 

129361 

129421 

129481 

129541 



AATCAACAGA 
GAGCAAGTAG 
CAAAGGGCAT 
CAGGGAGAGG 
GAGTGTGTGA 
CTCTGACAAA 
TTTGTGGGCA 
GTGGCTTTGG 
AAAGGCCTAC 
AATTCTTC CC 
CAACAGGGAT 
AGGTCTGAGG 
AGGAGGAGAA 
GGAGAAGGAG 
GAGGAGGAGG 
GGCATGCTAA 
CATCAAGGCT 
AAGTTAGAAT 
AAAAGG C ACT 
GAGGAATACC 
TGTAACTGAC 
TTGGGAGAGG 
ATAGAATAAG 
GTTATCTAAG 
ATCCTTGATC 
TGACTTGCTC 
CCCTCTTGCA 
GAGAGAGAGG 
CTCCACAGTC 
ATTTAAAGGT 
AAGAATTTTA 
CAAACGTATC 
AGGAAAAGAT 
CACTGATGAT 
GATCTCCAAA 
ACCCATTCTG 
TCTAGTCTTC 
ATTGGAAAAG 
ACTGTTGCTG 
TTTGTAGATA 
TGGAATCCTA 
TTTACCAAAA 
ACATGTATCA 
AAAAATTAAA 
GGCTGTTATT 
GGGTATAGAG 
ATGCAGGTCC 
CTTAAGCCTA 
AGAACCAATC 
TGTGGATATA 
CTCAACGATG 
ATCGAGCTGT 
AGAAACAGTG 
GCAATAGGAA 



TTAGATGTAG 
GTAGACAGAA 
GAAGAGTAGA 
GAGGGAGATC 
TGTCCTGGAA 
AAGCTGGAGG 
TCAGAAAGCC 
CAGGGAGAGA 
TCTCAAGGGG 
ACTGCAGCCT 
CAGAG CTTCA 
AGGAGGAGAA 
GGAGGAGGAG 
GAGGAGGAGT 
AGGAGTTGTA 
TAAACTGAGG 
CCAACTGAAT 
GAGACAAGTA 
CTGGATTTAG 
AAGGGCAGAA 
TCTCCCACAG 
GAAAGCTTTT 
ACAGGAAGAG 
GGAAACAATG 
TGGTTTATTC 
TGTGCCCCAG 
ACAGCTGGAT 
TTAGTGTACT 
CCAGCTCTGG 
GTGTCTGTCA 
TGGGAATGGT 
AGAGATAGTA 
TCTTTGAAGA 
GATATGTCTG 
CTGAAGCCAA 
TGAGGATGTC 
CTCCAGTGCT 
ATAGAAGAGA 
ATCTTGGGCT 
TCCCTTCATT 
ATATGGCACT 
AAAAGACAAA 
AAATATCACA 
GAAAAGATGG 
TCCCCCATGA 
GGGCTCAAAG 
CATGGGGGCA 
ACACAAAGTG 
TGGCAAACTT 
AATTTATATT 
CTTCAGCCAT 
GTCTGTGTCT 
TTTGAAGTAA 
GGAACAGAGA 



GGATGAGAAG 
CATTTCTTCC 
AAGCCTGGGA 
TGCTAAAAAG 
GTTGAGACAG 
CCAACTGTGA 
CACCAGGTTC 
AGAATTATTA 
AAAACATTTT 
CATCTAGGCT 
TGAAAATAAA 
GGAGGAAGAG 
GAGGAGTTGT 
TGTATCATTA 
TCATTATAAA 
CTTAACACTT 
AACAATGAAT 
TTGTTATCTA 
AAATAGGAAG 
GCCTCACTAT 
TGTGGCTTTG 

TGTAGACACA 
GGATCAAGCT 
AGTGTTTGGT 
AAGCCCAGCT 
TTGGCCAGTG 
TATTCCCTGC 
CCTAGCTCTG 
GGGTATAACT 
TGTTAACTAG 
ATGACAGAAA 
GATCCCCAGA 
TAGATAGAGG 
CTGCTGTTAC 
AACAAACAAA 
TTCTATTGGT 
CTAAATCTTC 
GCCTCATATC 
ATATGCCCTT 
CCTCCATTTT 
AAACTGATCT 
TAGTACCCCA 
T AAAT AT AG C 
AGGGGGGAGT 
CTTTGAGAGA 
GAGCCTCTGC 
TGTGCTTAAT 
TGTAGGACTG 
GATCAAAAAA 
GCTCAACTCT 
GAATAATGAA 
TGGGTAATGG 
TCTGTGGTCC 



GGTCAAAGAT 
TGAAAGGGCA 
CAGATCCTGA 
ACTGCAAATG 
TATCACATTT 
ATGCCCATGA 
CTGCAGTGAA 
TGAAATACAC 
GCCAGAGTCT 
TTCTGTCTCA 
TTGGAAATGG 
GAGTTGTATC 
ATCATTATAA 
TAAACACTTG 
CACTTGTGAC 
TGACTACAGA 
TATGAATGAA 
GAGATGCCAA 
TCATTAGTGA 
AGTGTGTTGC 
GAAGAGAGAA 
TTTAATTGGA 
GGAAAGAGGG 
G C AAGTAT AT 
CCAAACCCAC 
TCTACAGATA 
ATCAGCCCAG 
ATCACCCCCC 
GTTACAGGTT 
GGGAGCTAGA 
TTATAAGAGG 
GCAACTACCA 
ACTGGGACCT 
CATGATGAGG 
TGGATTCAAC 
GTGGGAAATC 
AGGGTTTGGG 
ATAACCAGCA 
CCCTGTTCTT 
CATATATTCT 
TCAGGACCAA 
AATTCCTGAT 
TAAATATATA 
TCTGTCAGGC 
GAGGGAGCAG 
GGAGAATGTC 
TCATTCACCA 
AAGTATTTGC 
GTGGGCAATG 
TTCAAGGTTA 
TCTGTAGCCA 
AAGACCATGA 
AAGCATGCTA 
TATGTCCCCT 



GACACTAGGG 
GGTCAGATCA 
GATGACCAAT 
TCAGGATAGT 
GAGAACATTT 
GAGTGAGAAG 
GATCTGAGAA 
CCCAGAACCT 
TATCCCAGCT 
CTTAAGGGAA 
TGCAGCCAGG 
ATTATAAATA 
ACACTTGAGG 
AGGAAGAGGA 
GGTCCCAGCC 
ATGCTGCTTC 
AGAGCTGTAA 
GAAGGCAAGG 
CCTTGTAAAT 
ACCTGTCAGA 
GTCAGCAGCT 
AAAGACTGAG 
CAGACAAAAA 
AAACTTGTCT 
ATCCCTGTTC 
GCATTAGCTG 
CAGGAATGTA 
TGCTTGGTGG 
CCCTCCCATT 
AATTGCACTG 
ACTGAAAATG 
CCTCCAGGTT 
CTGAGGAGTG 
CTGATTTTAG 
TGCCACTGCC 
TTTTCACATC 
GAGGTGGCTA 
CAGGGTGACA 
CCCATTAGCC 
TTTGGTTTAA 
AAGAGTATAA 
TTGATCATTA 
CAACTGTGTC 
AGTGGAGGTT 
CTGAAAGTAG 
TGAAAGAGCT 
GTGCCTCTTC 
TGAGTATGTA 
AAGATCAGTC 
GGTGTTTTTC 
CAGAAAAAAG 
TGCAAGGGAG 
CCAGGGAAAG 
GAGCATATTC 



TTTTTAACTG 
TGTGTTGTCT 
ACCCATGGTG 
AGAAAATCAT 
AAATTGGTAA 
CTCCCACACT 
GGATCCTCTT 
TCTTCAAAAC 
GGGAGAAGGT 
GAAAATTAGT 
AAAGGAGCAA 
CTTGAGGAAG 
AAGAGGAGGA 
GGAGGAGAAG 
CCAAGATATA 
TCCCTAACAC 
GGAGAGACAA 
AAGATAACTA 
AATGGAGCCA 
GGTCAGGAGG 
GCATGGAGAT 
CTATGTGTAA 
CAAGTGCACA 
TGATAGAAGA 
TGCCTGTCTC 
GGCAGCCCTG 
GATGGCAAAG 
GCAGCTCTTC 
GCCTCTTCAG 
AAATTGAACA 
GAAAAGTGGA 
TAGGAGAACA 
TATGCTGGAC 
GAGCATGGAA 
AGGTTGAAGA 
CTTCCAGCCC 
GCAAAGCGGT 
CTGGATCACT 
CTGTCACAAC 
CTTTTTCTGT 
AAGATTATCT 
CACAATCTAT 
CATTAAAAAT 
TTACCACGAT 
GTGCTTATAG 
G C C AAAT AG C 
AATATCTACA 
AAGTGGAAAC 
AGGTAAAATC 
TTCAGTCATG 
TTTACCCATA 
TTGGAGACAC 
GAAAGAAGTG 
ACATGTTAAA 
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12 9601 GCTAATTCAG TTTTCAATCA TCATTAAAAT TTTGTTCCTA AATATATGGC CATTATTTTC 

12 9661 CACAACCACA CTAAAACTTT ATTACCTCTG GCAAGTGACT ATGCAAGTAA CTAAGAGCAA 

12 9721 AAATATCCAC AACTACCATT TGAGCTATCA ATTTAGGGAA AGTCATCTGG CTATAATCTA 

12 9781 AGTGACCCTC CACTGAATGT CAGTATCTTT GCATATGTGA TTTAAATCTG GGCCTTCGCA 

129841 ACACCATGAA CTGTTCTTGT CTTGAATATC CAGATTGAAG GAAATAATCT GAGTAGTTAC 

12 9901 GAGTCCTGAA GCTAGAAAGA TGGAAACCCC ATTTGCTCAT CAGAAAGCCT TAGAGCTTGG 
129961 GCGCTGGCGG GTCCTGTCTC ACCGGGACAG AGGGGCTCTT TCCTCCCCAT CTGATAGTCT 
130021 GATAACTAGA GAAGCCGGCC AACTTATTCT CCAAGAAGGA GCCATCTTAG TTCCTCCTGA 
130081 AATGTTCATA TTTAGAAATT ATTGTTTGTC AGTAATTTAA CCCCTTAATG GGCTTGCCTT 
130141 GTGGTCCATA CCACTGAGTG CAGAGCTTGC CTGGAAGAAT TGTGAGGGCC ATTCCATCTT 
130201 CCAGGCAGTA GAGTTCAGTA CTTCTTTAAA ATTGCTGCTG AACTCTGTAT TTGAAAAGAA 

13 0261 AGAATCATTT GGGTGTGGTA GCTCACACCT GTAATCCTAG CGCTTTGGGA GGCTGAGGTG 
13 0321 GGAGGATCAT TTGATGCCAG GAGGACCACT T GAG AC C AC C CTGGGTAACA TAGCAAGACC 
13 03 81 CTGTCTTTAG AAAAAAAAAA TACAATAAAA TAAATACAAT AAAAATAAAA GCAAAAAGAA 
13 0441 AGAGTCCATC TTAGGGACAG ACTGTAACTA CTCACTGGAG CTTACCTTTA CATAGTTCAG 
13 0501 GATCAATTAT AATAAAACAC TTTTGTGCAG ATTCAATAGG ATTATTTTAA TCCCCATCAT 
13 0561 CTCTCTGAGT TTCCAGTCAG TTTCTCTGCA TGT AG AC AC C CTTCTCCAGC CCACCATTGT 
13 0621 CTCTCCTCCT ATAGCTCCAC CAACAAATCA GAACTTTTTC TAACTGCACC TAGTGCACCT 
13 0681 AGAGTCTACT CCAGAATGCT CATGGAGAAA GTTTCTGAAA GGTAAAACTC TGAATGATAT 
13 0741 TTGTAG CTAA AGGGAGACTT GCTAGAGACA ATAAGCTAAT AGTTGTAGAC TTCAGTAGAA 
130801 GAGGAATGAC ACTGCAATGT CAGGGTGCAG G ACT T C AAG A GGGCAGAGTA TGGAAACCCA 
13 08 61 ATGGGAAAAA TGCTCACCAG GAACATGAAG AGAAGGAATT ACGTGTAAGG ATTTCTCAAT 
13 0921 GTGTTCCCAA ATTTG CCCAG CAGAGGGAGG CCTCGGGTTG ATGGCAGGCT GACCACACAA 
13 0981 TTAAAGAAGG CTGAACCTGG GGGCTTTTAA CAACCATCGT GGGCTCTACT GTAAGCATTT 
131041 AGAAAAAGAA AGTTATCCAT T C AAAAAT AT ATATATTTTT AAACTTCAGA ACAAAATTAT 
131101 GAAGAGCTAT ATTTACTTTT CTACATTCTA ATTTTTATAA ATCTGAGTAT ATTTTGCATA 
131161 TATTGTTATA GTACATATTC AATTTTGTAT TTTGCTGTTT TCACTTAACC ATTTTTACTA 
131221 GATTACTCTG TGTTCATAAT AATCACTTTT TTAAAACTTT TATTTTTATT TATTTATTTT 
1312 81 TTTTTTGAGT CAGAGTCACA CTCTGTCGCC CAGGCTGGAG TGCAGTGGCG TGATCTTGGC 
131341 TTACTGCAAC TTCCACCTCC TGGATTCAAG CAGTTCTCCT GCCTTAGCCT CCTGAGCAGC 
1314 01 TGGGATTACA GGTGTGCACC ACCAAGCCCG GCTAATTTTT GTATTTTTAG TAAAGACGGG 
131461 GTTTCACCAT GTTGGTCAGG CTGGTCTCCA ACTCCTGACC TCATGATCTG CCCACCTTGG 
131521 CCTCCCAAAG TGCTGGGATA ATCACTTTTT ATGCTGCATA ATTCTTCAGA TTTGTCAGTA 
131581 CGACTGTATT TACACTCATT TGTTTTATTA GAAAGAATTC CAGAATATTT TGGCTGCCCT 
131641 AATTAATTTT ACAATTAATA TGATTTTGAA ATTGGGTATT GGCTCCTTCT GAATTGGTTT 
1317 01 ATTAAAATAT ATTCTAATGT AATTTATGAC ATTTTCATCA T ATT AG CAT A TTTATTCTGT 
131761 TAGAATTTCA TAATTTATAA AGCTACAAAC TGTATGTGAT ATAGCTTGTA ACTTTATCTC 
131821 ATAACTTTAT GCAGTTACAA GTAGAAATAA AATGTTCCCC TCAAGATTGC TTAAAATTTT 
131881 ATTATAAACA AGTGTAAAAA ACAAAATCAC TAAAACACTC CCTCTTTTTT CCCCCAAAAT 
131941 GCATGTTTCC ATTTTAACAG AACCCGTATT TAATCAGCAG ATTTCTATGG TGGCTAGATT 
132001 TGTAGACTAA ATATTAAAAG TCCCAAAGCA AATGCATTTT TCTCTTAAAT TTTACTGACT 
13 2 061 TTTTTTTTTT TTCTTTTTCT GAGACGGAGT CTTGCTCTGT CGCCCAGGCT GGAATGCAGT 
132121 GGCACAATCT CGGCTCACTG CAACCTCCGC CTCCCGGATT CACGCCATTC TCCTGCCTCA 
132181 ACCTCCCGAG TAGCTGGGAC CACAGGCGCC CGCCACCACG CCCAG CTAAT TTTTTGTATT 
13 2241 TTTAGTAGAG ACAGGGTTTC ACCGTGTTAG CCGGGATGGT CTCGATCTCC TGACCTCATG 
13 2301 ATCTGCCCAC CTCAGCCTCC CAAAGTGCTA GGATCACAGG CATGAGCCAC CGCGCCCCGC 
132361 CTACTGACTT TTATCCAAAG AAAATATAAG AGCTCTTCAT CATAACGTAT GTTTCTTGCT 
132421 CTTGTTATTA AATATGACAC ATTTAGACTT AAACTGATTT GAAGGTTTAT GACATTGTTT 
132481 AAGTTATTAC ATAATTAATT CATAAAGATA ATGACTAGTT TGAACTACTG ACAGCTCACA 
132541 CATCATCAGT TGAACAGCAG AAAGCTTACT AAGCTACTTT CTTATGTTTC TGTCTCCCAG 
13 26 01 CTACTAAAAG AAACGAAACC CTTCCAGGTG TT AAGG C AAA ACTTTCCTCC CCCTTTCTTC 
132661 TATAAATCTG ATTCCATGTT AGTGAAATTT CTACTGATGG CTTTGGTTTC CTCTATAGTA 
132721 GAATAGAGAT CCTATGGCAA AAGTCATGTC TGACATGGTA GCAAATAGAA ATGGGGAAAA 
132781 GGAAGGTCTG C AAG AG C C AA TGTGGGAAAT GGGGAGAGGA CTGACTACAA AAACCCAGCA 
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138541 
138601 
138661 
138721 
138781 
138841 
138901 
138961 
139021 
139081 
139141 
139201 
139261 



CATGACCAAG 
TAGAAGAAGT 
AATCCATTTA 
TGGCTTGGCC 
AGGAGCCTGA 
TTATGGAAGA 
AGACACTGAC 
CAAATATTTA 
TCTCCCTTTC 
TATTAAGACA 
GTACAAGTCA 
AGGTAATTTT 
GTAG CAACAT 
GTTAAACACC 
TCTCATTGAA 
GATGATAAAG 
GTTCTATTTG 
AAAAGAGGAC 
TTCTAATTAA 
GCTGGGCGCA 
CACTTGAGGT 
AAAATACAAA 
CTGAGGCAAG 
CACTGCACTC 
AATAAATATT 
TGTCAATTTG 
TCTCCAAGTT 
TTTTATTAAA 
ACACAAAGAT 
ACTGGTTTTG 
ATGTTTCTTG 
CTTTATGCAA 
AAACTTCTTT 
AAAACAAAAC 
AAAAGTGAAA 
GAGAATCACT 
CCAGCCTGGG 
AAAAGAAAAA 
GGACTCGCAC 
AAGAGTAAAA 
CATGAAAAGA 
ATACTTCACA 
AGGGTGTGGA 
TTTGAAAAAA 
TTCTGGGTCA 
TCATACTGGC 
GTGAATGGGT 
CAGAACTTTG 
ATCCAACATG 
TGTAATCCCA 
GTTGCAGTGA 
ATCTCAAAAA 
AGGGAATAAA 
AAAAAATACA 



GTATTATGAG 
GGATTACAAG 
GAGAGACCTT 
ATCTTCAGCA 
AGAGAAAAAA 
ATACACATCA 
AAGTAGGACT 
ACAGCTTCAA 
CTGCCATGTC 
ATAATGAACC 
GATGATTCTG 
TGTTTTGGCA 
GGATGGAATT 
ACATGTTCTC 
GTAAAAAGTA 
AGAGATTCGT 
TACTACAGAA 
ATTGAATGTT 
TTACC CTGAT 
GTGGCTCACA 
CAGGAGTTAG 
AATCAGCCAG 
AGAATTGCTT 
CAGCCTGGGT 
TTTTAAAAAA 
AAACATAATT 
GATATACTTA 
ATGCTATAGT 
CAGTGAAACA 
ACAAAGGTGT 
AACAAGTAGA 
AAATTAACTC 
AAAAAATAGG 
AACAGGTTTA 
ATTTGCTTTT 
GGAACCCGGG 
TGACAAAGTG 
GAAAAATCAC 
CTGGAAAATA 
GTTTTCAACA 
TTTTAAACAT 
TTCAACAGAA 
GGAAACTACT 
GTTTGGCTGT 
TTTCTCCCAG 
TTTGTTTCAC 
AAATAAATTG 
GGAGGCCAAG 
GTGAAACCCC 
GCTACTCGGA 
GCCAAGACCA 
AAAAAAAAAT 
CTACTGATAT 
TATGATATAA 



ATTCTGGAAT 
TGGAACTTAG 
TCTCCAGTGG 
CATGGCTCCC 
AGG C ATGGAG 
CTTTTGCCCA 
TAACAAGAAT 
ACACAGGTGC 
ACAGGGGCCA 
AATACAACAT 
ATGATTATGA 
AATTTTTGTT 
GCAGGATACT 
ACTTATATGC 
CAACAGAGAT 
TAAAATAAGT 
TGGCAATAGT 
TCCAACACAA 
CTGATCACTA 
CCTGTAATCC 
AGACCAGTCT 
GCGTGGTGGC 
GAACCCAGGA 
AACAGAGCAA 
AGAACATCAC 
TTGAAAAATG 
AAAGGAAAAA 
AATCTGGAAA 
AAACAGAGAA 
CAAGGCTATT 
CATCCGGTGT 
AAAATAGACC 
AGAAAATCAT 
TAGTTTATAA 
CAAAAAACAT 
AGCTACAGGT 
AGACTCTTCC 
AGGCTGAGAG 
TAAGGAACCT 
GACATTTCAC 
CATTAGTTAC 
TAGCTAATGT 
CTCATATATT 
TTCTAACATA 
ATAAATGAAC 
AATGCTATAA 
TAATATATCG 
ATGTACGGAT 
ATCTCTACTA 
AGGCTGAGGC 
TGCCATTGCA 
TGCAATATAT 
ATACACAAAA 
ATTCCATTCA 



TTCCCCAAAC 
AAGGGGAGTA 
TGACTCAAAG 
AAGGATGTCC 
TATTGTGAGT 
CCTTCTACTA 
CTAATTTTGA 
ATTGCTATCA 
GCATTTATGT 
CTTGAGCATA 
ATTATGTCAA 
TGTTCATGAC 
ACATTAAGTG 
AGAAGC TAG C 
TACTAGAGGC 
TACAGCTAGA 
TAACAGTAAT 
AGAAATGAGA 
TACACAGTAT 
CAGCACTTTG 
GGCCAACATA 
ATGTG CCTGT 
GGCGGAGGTT 
GGCTCTGTTT 
TATGCACCCC 
AAAAAATGAA 
AGTCCGAGGG 
GTATTTCAGA 
CCCAG AAATA 
TAATAAGTAA 
GGGGGAGAGG 
ATAGACTTAA 
CAACACCCTA 
AACATAAATA 
TATAAAATGA 
TGCAGTGAGC 
TAAAAAATAA 
AAAATATTTA 
TATAACTTAG 
AAAAGAAAAC 
TAGGGAAATG 
TAAAAGGACT 
GTGAATGTAA 
AAATTAAACA 
ACATGTCCAT 
ACTGGAAACA 
GCCAGACGCA 
CACCTGAGAT 
AAAAATTAGC 
AAGAGAATCA 
CTTCAGCCTG 
CTATATCTTG 
TGGATGAATC 
TATGAAATTT 



CACATTGATT 
TTCGAGAAGA 
ATGCAGCTCC 
TCAGGATGGT 
GGTAGGTGGT 
ACCAGAACTC 
GTCTAGGAAT 
CTATGCTTGG 
CTAGATTGGG 
AAACCAACTG 
TAAAAGAAAT 
AGGATGAAAT 
AAATAAGCCA 
TAACTAAGTA 
TGGGAATGGT 
TAAGAGCAAT 
AAATAATTTC 
AATGCTTGAA 
GTATAAAAAT 
GGAGGCCAAG 
GTGAAACTCC 
AATCCCAGCT 
GCAGTGAGCC 
CAAAAATAAA 
ATATATACAT 
ACACAAATAT 
CTTAAACTAT 
ATGAATTGGT 
GATTCACACA 
AAAAATCGTC 
AGCAGGAGCC 
ATGTAAAAG C 
GGATTAGCAA 
ACAAAATGAT 
AAAGCAGGAG 
CAAGATGGTG 
ATAAATAAAT 
TAATACATGT 
TAAGATGACA 
ATACAAATGG 
CAAGTCAAAA 
GACAATCCCC 
GAGGACAATG 
CTTATACAGC 
ACTATGACAT 
ACCCACGTGT 
GTGGTTCATG 
CAGGAGTTTG 
TGGGCATGGT 
CTTGAACCGA 
GGCAACAAGA 
GAATATTATA 
TCAAAAATGT 
TAGGAATGGG 



GCTGGGAGAA 
CGTCTCTGCA 
TTTCATCCTG 
CTCTAATCCA 
TATGGACCAG 
ACACAGCCAT 
ACGACTGTAG 
CCCAGGCCTG 
TTGGTTGGGA 
ATACAATGAT 
GTGATAACTA 
CCTGTCATTT 
GAAACAGAAA 
AATAAGTTTA 
AGGGGAAAGA 
CAGTTCTAGT 
AAAGAGCTAG 
ATAATGGATA 
AACACTATGG 
GTAAG CAGAT 
ATCCCTACTA 
ACTCAGGAGG 
GAAATCGCGC 
TAAATACATA 
ATAATTATTA 
GAATCAATCC 
TCAATCAAAA 
ATAAGGTTAG 
TCTATGGACA 
TTTTCAGTAA 
TTACCTCAAA 
TAAAATTATA 
AGATTTCTTT 
AAATTTCATC 
GCTGAGGCAT 
CCACTGCACT 
AAATAAATAG 
ATCTGACAAA 
AGCCAAAACA 
CCAGTATGCA 
CCACAATGAG 
AGGGTGAGCA 
TTACAACTAC 
CCAGCAATAT 
GTACAAATGT 
CCATCAACAG 
CCTGTAATCC 
AGACCAGCCC 
CACGGGCGCC 
AGAGGCGGAG 
TGGAAACTCC 
AAGCAATAAA 
GAAGGAAAAT 
AAAACTAAGC 
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TGTAATTATG 
AGGTGATACT 
GTAGTAACTG 
ACTGTGTATA 
ATGAGGAGAG 
TCATTATATA 
AACCTTCTAA 
CCCTTGACCC 
ATCTATCAGA 
AATACTTCAA 
ATTACATACT 
CATCCCACTC 
CCATCAGCTC 
TGTAAGGACA 
GCTGATGGCG 
CAAGAGGAGA 
TAGAAACTAC 
GGTCTATTCA 
AG CTATACC C 
GTTTTAAAAT 
GCAAAAACAA 
TCTGGGAGGC 
CATAGTGAAA 
TTTAATCCCA 
GTTGCAGTGA 
TCTCAAAAAA 
TCGTTTAAAT 
AATAAAAAGA 
GGGACGCAAG 
CTGAAGTAAA 
CCATTACTTC 
TTCAAGGCTA 
GTCCCGCCCC 
CTTAACGCAC 
CTCCTTTCCC 
CTGTGGCTCT 
AGGCTCACAC 
GGGAGGCGCG 
CAGGGCCGGC 
GCGCCCTGGC 
GGCGGCCTCC 
TGGCCTCAGG 
TGAATTCAGT 
GTTTTTACGC 
CTTTCAGAAC 
CTTAAATTTC 
GCCTTCTGAC 
CGAGTCGTGC 
TGCAGATCCA 
GACGGGCCTT 
AAGTCATTCT 
ACCTCGTGAC 
TCTGGCTGGT 
TGTATTATCT 



GAAAGTACAT 
ACAAATGGAA 
TTTGGGTATA 
AATTATTCCT 
ACAACTATTA 
TGGAAATTAA 
ATTTTATCTT 
TTTCTCATGT 
TAAACGTATT 
TATCTAAAGA 
CTCCAGGGAA 
CAAGCCCTCC 
TGGATATCTG 
GCACTGCCTG 
TGTTTGAGGA 
GCCTTGATGC 
TATGAACACA 
ATTACCTGGG 
CATATCATAC 
TTTAGAAGAA 
AAATGATTAA 
CGAGGTAGGT 
CCCCATCTCT 
GCTACTTGGG 
GGGGAGATGG 
ATCTAAAAAT 
GATATGACAC 
GGCGCTGAGA 
CGCGTTCTTC 
CACAGTCACA 
AATTTTTGTA 
GGAGGGGTGG 
CATTGCCCAG 
CTCGTTTTTC 
GGAGCTCTCG 
CCAGAGGGCG 
CAGGCCTGCG 
CAGGGCGACT 
GCTGCGGGCG 
TGGGAGCCGG 
TCTCCGCTCC 
CTGAGACCCC 
TTTATGTATA 
ACAAAACATG 
TTTAACCTGT 
ACCACAGCCT 
CACGAGTCCT 
AGCCCGCCCT 
GGCGGCAGCC 
GAACGCTCCC 
TGGGGCCCCT 
ATCCCTGGAA 
CCCTCCTCAC 
GGTTATTTAA 



CAGTGGCTGC 
ACTATCTAGG 
TAAAACG CAT 
CAATAAAGTT 
TCTTAAAATA 
AGGAGAAAAA 
AGCTCATAGT 
TCCATCTTTC 
CATATGAAGA 
AGGTACAAAC 
GACCTGAAGA 
CCTCCAATCC 
TACTGATTGA 
AATC CTATTT 
TCCAGAGGGG 
AAAAATAGGT 
ATAGAGCAGA 
CTGAGAAACT 
ACAAAAGTTT 
AATAGTGCAG 
AGTGGCCAGG 
GGATTAGTGG 
ACTAAAATAC 
AGTCTGAGGC 
CGCCACTGCA 
AAAAAGATTA 
ACTACATCTA 
GCATACAACC 
CTTCATTTTA 
CGTTAACCTT 
CTTTTGCATT 
CTCTCTCTTA 
TCCCCACCCT 
GTCCAGTGGA 
CTCTTAGAGG 
CGGCATTCAG 
GGAGGCGGCC 
GCAAGCTGTA 
GGGCTCCTGC 
GGAGCATTTG 
CCGCCCGCCA 
AACGAATCAT 
ACTGAATTAC 
AGACACAAAT 
TTGCTGAAGT 
CAAAGAGGCC 
GCGGCTATGA 
GCTCCAGCTG 
CAAGATTTGG 
AGGACCCACA 
GGGGGCGGGC 
GAGAAGTACG 
CTACATACTT 
ATATCTGGTT 



CTGGGGCCAA 
TTGACGGAAG 
CAAATTGTAT 
GATTTTTCAT 
GTTAAGCACA 
TAGTG GTATG 
TGTAAAAGCT 
TGTTTGTTTC 
TTTAAATATA 
AAAACAAAAA 
CTAGCCCCTT 
CATATGCACT 
TGCTCCTGCT 
ATCTCTCGCT 
TCTTTGGTTG 
GAAGAAATCA 
AGTAGCCTTA 
GGCTTTCATA 
CTACATCTAA 
AATTTTAGTG 
CACGGTGGCT 
AGGTCATGAT 
AAAAATTGGT 
AGGAGAATCA 
CTCCAGCCTG 
TTTTTAAAAG 
ATATTTGGAA 
TATCCTCAGA 
ACTGGTCATT 
TAAAAATCTA 
TTAAAATATC 
AGCCCTGTCC 
CACTTCCCCA 
CTCAGACCTG 
AAATTGAGAG 
ACCCCAGATC 
CACCAGAGGC 
GGGGGCGCTG 
GGCGTGAGGG 
CTGCTCTGCT 
TCCCCCAACT 
TCCCCGCATG 
GGATATGAGA 
CTGTAAGAAA 
ACGTCAGTAA 
ATTTCGTGGA 
AAGAGGAAGC 
GGGACACCGG 
TCCAGCCGCC 
TCTGGAGAGG 
ATGGACCTGG 
TTCAGTGTCA 
TTCTAATTTG 
ATTTAAAAGC 



GAGGATGGAA 
TGTTCTGTAA 
AATTAATACA 
TAAATATATT 
ATAAAAATAC 
ATTAATTAAA 
GCCATCCCTA 
ATAGTTTATG 
TTACATGTTA 
TCAACACTTA 
TCTGGATCCC 
GGGCATTCAT 
AACTACCTGA 
ATGCCATAGC 
GCAGGATTGT 
G T AC AAC AAA 
AGAGTTGGTG 
TGGAATAAAA 
CAAAGACACA 
CAGAATTTCT 
TATGCCTGTA 
TTCGAGACCA 
AGGGTGTGGT 
CTTGAACCTG 
AGCAACACAG 
ACTATTTTAA 
AAGTACTTCT 
AGAGTGTTTG 
TTCATTTATT 
GGAGGTGCGT 
ACAGGGAAGC 
CCGCCAGCCC 
TTTCCCCACT 
TAGTCTTCCA 
AAGCATCAGC 
CAGCTGTGAG 
GCTAGAAAAC 
GCGCCCTCAC 
GCGGCCCCAG 
GGACCCTGAG 
CCCGATCTCT 
GGAACATTTT 
ATCTCAAATG 
TATAAAGTCG 
CAATGGCAGG 
TCCGCTGAGG 
CGCGGTTCAG 
TGGTCACGGC 
AAGGGGTGGC 
GAGGTGGGGG 
GTAAGG CCAG 
CTCCAGAGCT 
TCTGGAGCAG 
TCTCCATTAA 



GAGGCGGCAC 
CTTGATTACA 
GGTGTATTTT 
ATTTGCTAAA 
TACAATCAAC 
ATAAAAAGAA 
ACCAAGGCCA 
TCTCACCAAA 
AGCCTTAGCG 
GTTATAAGAG 
ACTAGCCCCT 
ACAAATAAGA 
ATGATTGCGA 
GGCCTTCCAT 
TTTATTTCCC 
ACAGAAAGCC 
GAGAAAGGAT 
ATAAAATTAT 
GATAGAAAAT 
TAGACTAGAT 
ATCTCAGCAC 
GCCTGGACAA 
GGCTCACGCT 
GG AG G C AG AG 
CGAGACTCTG 
ACAAAAAAAA 
TAATACTTTT 
ACCTCTAGGA 
TCAGGAACAT 
ACGCATAGTT 
TCGGTACAGC 
CAGACCTCTC 
CCCGCGGTCT 
CCAGGATCGG 
GG AGACC C AT 
AACGGACCCC 
AAGCCTCGCG 
AGGCCAGGGG 
GCCAGCAGCT 
TCTGGCGGCG 
CTGCTGCGTC 
ATGATATAAC 
AGGACGAATG 
TGACCACGTC 
GAAAGGGTAT 
CTTGGAGTCG 
GGCGTCCTCG 
GCTTTCCAGC 
TCGAGTGACT 
TGGGGTGCTG 
AGAAATTGAC 
GAAACCGCCT 
GCCGGGCATC 
ATTCACATAC 
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ACGAAAATAA 
GTCCTACACG 
TCTGAAGTAA 
TGGCCCCAGA 
ACTGGGAAAA 
AGAACCAGGG 
GTCACTGGTC 
AAAGGTGAAA 
CACATTAAAG 
GAGTTTAGGT 
AGCGCTGAAG 
CGGAATATGA 
ATTACTCTCC 
GTGATAATTT 
ATAAAATTTT 
GATAACTGTG 
AATTATTTAA 
ATATTTATTA 
CTCAAGCCTG 
AGTTCAAGAC 
TGTGTGGTGG 
TGAGCCTAGG 
CAACAGAGCA 
CATACTACTG 
AGATAAAAAG 
ATATC CTGTA 
GGCCAAGGAG 
AGACC CTGTA 
TTGAAAGCCA 
CAAAAATGAA 
ACTATCTAAT 
TTTAAAAAAA 
GAAGATTTTG 
TTAAGTTACC 
TAATCCCAGT 
CAGCCTGGGC 
ATACAAGACC 
TTATTTAGCT 
TTCTTTTCTC 
CAGGTAATGT 
ATACTTAAAA 
TCTGTTCACC 
TGTCATTAGG 
CAAAATTTAG 
TCATTACTGG 
ACAAAAGTAA 
AAGGAATTGA 
TTCATAATTT 
GGTACTTCTG 
GCCGAAGTGG 
AACCCCAATT 
GATATGAGAA 
TCAGCCTGGG 
AGAAAAAAGA 



AAATTAAAAA 
ATAGTGAATA 
TAGACGCCCA 
ATTCTTTAGC 
CCTCATTTAG 
ACTATTGCCC 
CACTTTCACC 
ACGTTTCTTT 
CAAGTGCTCA 
CTTGGAGAAA 
ACTGAGGACC 
CAGTTTGGAG 
T AAC CAG AAA 
AATGTGTTTT 
AAGAATTAAA 
AATTATAATT 
TAAAACCTAT 
TTATAAATTA 
TAATCCCAAC 
CAGCCTGGGC 
TGCGTGCCTG 
CAGTCAAGGC 
AGACCCTGTG 
ATACAGAATT 
GAGAGTGGAA 
GCAGAACAAA 
GGAGGATTGT 
TCTAAAAAGG 
TTTTCTGCAA 
TAGATATTAG 
AGTATCTAAG 
G AAAAC CAT A 
TTAGAAAACT 
AAACAGTTGC 
ACTTTGGAAG 
ACTGTAGCAA 
TTGCTAACAA 
TTAGAGTACT 
AAAGGACTTG 
TTGCAAAAGA 
CTCAACAGTA 
AAAGAAGATA 
GAACTGCAAA 
AACACTGTCA 
TGAGAATGCA 
CCATACTTTT 
AAACTTATCT 
ATCCAAAACT 
AATAATGGAA 
GTGGATTGCT 
AGCCGGGCAT 
TCGTTTGAAC 
CAACAGAGCA 
AAAAGAAAAA 



AAATTTTAAA 
AATTTTTTTG 
GAGAAGGGAT 
AGACCGTGTA 
TATGTTACAT 
CAATATTATA 
CAGTTGGTAA 
TATAATTTCA 
GGAGTGACAT 
AGAGACCCAA 
CTGCCTGTGG 
GGGCCTGAAG 
AGGTATTTCA 
AAAAAATGTA 
AAATATAAAA 
AGTAATTAAA 
TTAACATTTA 
TTTTAGAATT 
ACTTTGGGAG 
AACATGGAGA 
TAGTCCCAGC 
TGCAGTGAGC 
TCAATATACA 
GAGTAGAGAT 
GAAGGTATGT 
ACAACAAAAC 
TTGAGCCCAG 
AAGAAAGAAA 
ATACATAGTG 
TTGCCTGAAA 
CTAGTAAATT 
TTTATAAGAA 
ATGAGAAAAA 
TAAAGAATAC 
GCCAAGGCAG 
GACCCGTCTC 
TAGCAAAGAT 
CTCGTGATAT 
CAAATTTACA 
CAGATCTGAT 
AGAAAATAAC 
CACAGATGCA 
TTAAAACAAG 
GCACCAAAGG 
AAATGTGCAA 
ACCATAAGAT 
CCACACAAAA 
TGGAAACAAG 
TGTTATTTAG 
TGAGGCCAGG 
AGTGG CGTGA 
CTGGGAGATG 
AGACTCCTCT 
GAAAAGAAAC 



AAAAAGAAAC 
TGTGGTCCCT 
CGACTTACCC 
TATGAACGTC 
GCCTAGCGTT 
TTTCAGGAAA 
ATGAAACCAG 
CATACAATCT 
CAAGATGTAA 
GGAGACACAA 
ACTGAAGTGA 
GACTCTTCTA 
ATTTATATTT 
TCACAGTGAT 
ATCTTTTATA 
TTGAAATACT 
ATATTTATCA 
AAAAATAAGT 
GCTAAGGTGG 
AAC CCTGTCT 
CATTCTGGAG 
CCTGATCTTG 
TATGGACAAA 
GCAAAG CTAG 
CATGAATTTC 
TGTAGATAAA 
AAGTTGGAGA 
AAAAAAAAAA 
AATT TGATC A 
TAAAAATCAA 
TGGCCAGTTA 
GAGGTGATAA 
AACTATTTTT 
CAGATGGCTG 
GAGGATCATT 
TATTAAAAAA 
CAATTAATTC 
GAGATTGCCA 
AAGAAGTGTT 
GAAGAACAAT 
CTGATTTAAA 
AGTATGCATA 
TAGATACCAC 
TTGCAAAGAT 
TCACTTTGGA 
TCACCAATCA 
ACCTGCACAT 
ATGTCTTTCA 
AGTTAAAAAG 
AGTTTGAGAC 
GCCTGTAATC 
G AGGTTG CAG 
GTCTCAAAAA 
GATCAAGCCA 



AAAAGCTCTC 
AAAATTGAGT 
ATCATGC C AC 
CTTTG CAATC 
TTGTGCCTGA 
GGAAGGCCCA 
AAATTATAGC 
TTAATGGACC 
AAAATAGTCC 
GACAAAGGGG 
GGATGGGGAC 
TTCTCTATCA 
TCCATCACAG 
GGCCTGGTGT 
TAGACATTAG 
GATTATTTTC 
GTAATTAAAT 
GTAGAAGCGA 
GAGGATTGCT 
CAATACAAAA 
GCTGAGGTGG 
CCACTGCACT 
CTTAAAATTT 
TCCTATAACC 
ATGATAAATG 
ACATATCCAA 
CCAGCCTGGG 
GGATGATAAA 
GTAATTTTCT 
ATATCCAACA 
TAAAATGTCT 
AGAGAAATTA 
TGTTTTCAAA 
AGCGTGGTGA 
TTAGGCCTGG 
AAAAAAAAAA 
AAAATTTGAA 
AATTAATACT 
GAAGAAAAGC 
ATTTTTAGAA 
GCAGGCCAAT 
TGAAAAGATG 
TGCATACCTA 
ATGTAGCAAT 
AGACAGTTTG 
CACTCCTTAG 
AGATGTTTAT 
GTAGGTAAGT 
AAATGCATTC 
CAGCCTGGTC 
CCAGCTACTC 
TGAGCCAGTG 
AAAAAAAAAA 
TGAAAACACA 



TAATGACCAA 
TCATGCCTTT 
AGAGATTAAT 
ATATAAATTA 
ACACCTTACA 
GACAAATGGT 
TGTACCACAG 
CAGTGTCCAA 
TGTCCTCAGG 
AAAGAGAAGG 
ACCCGATGCC 
GAAAAACAGA 
CACTTTTCTG 
GAAATAAATA 
GAGTTACAAG 
ATTTTTATTT 
CTAATTGTTA 
GGCATGGTGG 
TGAGCCCAGT 
AAATG AG CCA 
GAGGATGACT 
CCAGTCTGGG 
AAAATGAAAG 
AGAACAATAA 
GCAATTGCAA 
CCCTTTGGAA 
CAACATAGTG 
GTAGACAATA 
TCCAACAGTG 
AAAAATATTG 
TAAATTTTTA 
TTTCAGTTAT 
AAGTGAAAGA 
CTTATGCCTG 
AGTTCGAGAC 
AAAAAAAAGA 
AAACTGTAAT 
TTGGGTGCAT 
CACACATTGG 
TATACAAAGA 
GACCTGAACA 
CTTGACATCA 
GTAGAATGAC 
AGTAACTTGT 
GTGGTTTCTT 
TATTTATCCA 
AGCAGCTTTA 
GGATAACTGT 
ACTTTGGGAG 
AACATGGGAA 
GGGAGGCTGA 
CCACTGCACT 
AAGAAAGAAA 
TGAAGGAAAC 
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14 5801 TTAAATGTAT GTTACTAAAA AGCCAACCTG AAAAGACTGC ATACTATATG ACTCCAACTG 

14 5861 ATGCAGGGCA AGCAAGCCAA AAATTAGGGC TTAGCCCGGG AAGAATTCAA GGGTGAAGTG 

14 5921 GTGGTGTTAG CAACTTTTAC TGAAGC AG C A GTGTACAACA GCAGAACAGG TACTGCTCCT 

145981 TGCTGAGCAG GGCTAACCCA TAAGTAATGT GCC CAGAGTA GCAGCTCAGG GGCAGTTCTG 

14 6041 CAGTAATATA CCTGCTTTTA GTTAAGTGCA TGTTAAGGGG GATTATGCAG AAATTTCTAG 

146101 AAAAAGAGTG GTAACTTCGG AGTAGGTACA GAGGAAAGAA GTCGATAATG TCCTGTTGTT 

146161 GCCATGGCAA CGAAAAACTG ACATGGCGCT GGTGGGCGTG TCTTATGGAG AGGTGCTTTA 

14 6221 ACCTCGTCCC TGTTTCGGCT AGTCTTCAAT CTGGTCCGGA GTAAAGTCCC TGCCTCCGGA 

1462 81 GTTCACTCCT GCTTCCTGCT TCACAACTGT ATGACACTCT AGAAAAGACA GTAACTATGG 

146341 ACACAGTCAA AAGATTAGTT GATAGAAATT GGGTGACAGG AAGTGTTGAA AAGGCAGAAC 

1464 01 ACAGGATTTT TAGGGCAGTG AAACTTCTGT GATACTATAA TGGTGAATAC ATGACATTAT 

14 6461 ACATTTGTCA AAACCCATAG AAAGCACAAC ACCAAGAATA AACCCTAATG TAAATTACAG 

14 6521 ACTTTCGTTG ATAATGACGT GTCAATGTAA GTTCAATTGT AATAAATGTA CTACTGTGGT 

14 6581 GCTGGATGTC TATGGTGGGG GGACATTTTT GCTTCAATAG TTACAGTTGA AGTAAATGTT 

14 6641 TGTGTTTCCC ACAATGCATA TGTAGAAACT CTCACATTCA ATGTGATGGT CTTTGGAGGT 

14 6701 GGGCTCTTTG GGTGATAGTT AGGTTTAGTT GAGATCCTAG CAGATCGAGT CTTCATGATG 

14 6761 GGCATGATGG GACTGGTCCC TTATAAGAAA AGACCAGAAA GCTAGCTCTC TCTTTGCCAT 

146821 GTGAAGACAT AGCAGGAAGG TAGCCATCTG CAAGCTAGGA AAGGGCCTTC ACAAAGAATC 

146881 AACTCAGACC TCAGAACAGT GAGAGATAAA TTGTCGTTGT TTAAGTCACT CAGGCTGTGG 

146941 TATTTTGTTT CAGCAGCCCA ACCTAAGACT G TT AATTGG A TTAGAAATTT CCTTTTGGGG 

14 7001 ATGGTGTGTG GCGGGCGGGG GGCGGGGAGT ACCTTTGTTA AGCTTTTATA TCAATGAGTT 

14 7061 TGTAGGCTTT TCTTTTTTGG TCATTGACTA GGACAGTTTA AATAGTATGA GTGTGAAGGA 

14 7121 GATTGTTGGT CATCTATTCG ATGTCCCTTC TCTGTTTTTT AATATGAGAA CTCCTGATTT 

14 7181 TCAGCCAACT ACCCTGGAAA AAAAG CTAAT CTTTCTGACT TCTTAAGTGT GGCCATGTAC 

14 7241 TAAATTCTGG CTAATGCAAG GCAAGCCAAA GGTTTTATGA TAGGTTTTAG GACACTAGAG 

14 73 01 TAAAAGAGAG CTGTTGCACA CATGCTCTTC AC C CTACTTT TGTGTCCTTT TTTCCATCCT 

14 7361 ACAACTTGGG TTGTGAGTAT GATGGCTGGA ACTTTAGTGG CTCTCTTGGA TCCCAGGGGT 

14 7421 AATTGAGGGG TGGCTGGAAG GAATCTGTGA TTTTCTGGAG TTTCCATACA CAAACAAGAC 

14 74 81 CTGGATTTTC TGGGCTTCCC AGACTTCCAC ATCTAGACTT GCTTTAAATG GGAGATAAAT 

14 7541 AAACTTGTTT CAGCCACTGT CATTTTGGGC TATTTTATAG AACTTAATCT AATCTTCAAG 

147601 GGTACATGAA TTGCTTTTCC TTAAAAAAAA AATCAGCCAT AAAATCATCT TCTTTTTTCT 

14 7661 TTTGTTCCCC ACATTATTTA GTTGGAGCTC TGTAACTTTT TTTTTTTTTT TTTTTGAGAC 

14 7721 AAGGTCTTGC TCTGTCACTT AGGCTGGAAT TCAGTGGCAT GACCATGGCT CACTGCAGCC 

14 77 81 TTGCCCTCCT AGGCTCAAGC AATCCTCGTC TCAGCCTCCT GAGTAGCTGA AACTAAGGCA 

147841 CATGCCACCA TGCCCAGCTA ATTTCTTTTC TTTTAGAGAT GGGAGCCTTG CCCAGGCTAG 

14 7901 TCTCAAACTC CTAGCCTCAA GTGATCCTCC CATCTCAGCC TCCCAAAGTG ACAGGATTAC 

14 7961 AGGTGTGAGC CACCATGCCT GGCTGCTCTG TAAGTGTCTG AATTTCATTT TGTATTTATC 

14 8 021 AGTCTGTTTA GATTTTCTTT CCCTTCTTGG GTCAGTTAGG CCATTGGTTT CTTTTTAAAG 

14 8081 GTTTTCAAAT TTATTTGCAT CTAATTCTTC AAATTACTCT CAAAATTATT CCAGTATATA 

148141 TTCTTTTGTT CCTATTTTCT TCTGTATTCT TTATTAAAAT AGCTAATGAT TTATCTAGCA 

14 8201 GGACTTATAT TCTTTCCATA ACTTTCCTGC ACCCCAATTA ATCTCCAATT TTATATTTCT 

14 82 61 TCTGGCCTTC CTTATAGTTT CCACAGGTTT ATTTTATTCA TTTTTTAAAA CTTTTATTTA 

14 8321 ATTGTTTATT TTATTATCAT TCTTTCTTAT TCAGCAATCT AAGTGCTTAG GGATATAGAA 

14 8381 TTTCCTCTAA GCAGCATATG CTAGGCTTTA ACAATGTTAG GGAGGCCTCC CCTTTCTGGG 

14 8441 GAAGACCACA CTTACATTAA CACAGGACTG TGGGATGCCA AGAGGTAGAG AAGAGCTTAT 

148501 GAATATCCAG ATTACATCTT CACTGATCCT GCACAAAGGT GGGGTTCCTC GGTTACCCAC 

14 8561 TGGGTCCTAT TACCCAAGTC TGGGTCAGCA TACCGAGACT ACGGGTATAT AGAACAAGTG 

14 8621 CAACTGGCGA TAATCCTTCT GTTGGGGAGA AAAATCTTTT TTTTCTATTC ATCTTAGGTT 

14 8681 CTCCATCTGT GGCCCTATCA AGTAGACTAA CAAAAGACAG ATTGACAAGA CAGAAACAAA 

14 8741 GCATGTGCAT TGTACAAACA CAGGGGAGTA CTGAGATGAA TACTCAAAAG AGGATTTAGA 

14 8801 ACTTGGGCTT ATATAGCATT TTAAGAAAAG AATACATTTT TTAAGTGACA AGGAAGACGA 

14 8861 AAAGGACTTT GAGTTTCTAG TGCAGTAAAT TGTGGGAAGG CAACTTTTTC TTTCCCTTTT 

148921 TTTTTTTTTT TTTTTAAAAA AAAAGACTTC TCTGGTGCTA TGTCCAGGCT GATAAGAGTC 

14 8981 TAAAGTCTCT GGTGACTAAC TTTTGTTCTT CCCCGAGTAA GAAGACACCT TCACAATTTC 
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ATATCCTGCT 

TTTTTTCTCA 

ACCCTTCACT 

GGGAACATGG 

GCCATGCCTC 

GTTGTAGGAC 

TGATCTTGGC 

GTAGCTGGGT 

CAGGGTTTCA 

CCTCAGCCTC 

TTATGTCAAC 

TGTTGTTTAT 

TTTTCAAAAT 

AGCCATTTCA 

GCAGACTGGA 

ACTCATAATT 

ACACTTTGCT 

CGCTTAGATT 

TGTTTTGTTT 

TGAGTTCAAA 

CTACTCATCA 

ACAGCAACAT 

GGTATTTTTA 

CTCAGCACTA 

CTTATGGGAT 

CTTAGTCACC 

GGAGAGTATC 

TATAAAAGGA 

CATGCAGATG 

GCCTGAGGGT 

TTTCCATACA 

GTCTTCCTGA 

AAGATGTGTC 

GAATTAGTTC 

AAACAAACAA 

TTGAGATGGA 

CAACCTCCAC 

TACAGGTGCG 

CGCCATGTTG 

TCCCAAAGTG 

TAAAACCAGC 

AATCATTTAA 

TGCAGAGAAT 

CTGGTACATG 

GCCGTGGGGG 

CATAAAGTGG 

GAGCCCACAC 

ACAGGTGAGG 

ACAGTTTAGC 

ACACGTTTAT 

GGGGTTTTAT 

CTTAGATTTG 

AAGGCCACGA 

TGACAGACAA 



TTTAGGCAAA 

ATTGTCTTCA 

TACTACTTAC 

GCAATAGTTT 

TGACATCTAG 

TTCAACAAAT 

TTACTGCAGC 

TACAGGCCCC 

CCATGTTGGC 

ACATAGTTCT 

TCTAAATTAT 

GTTTTAGTTG 

GAAGTTAAGG 

GCAATAACTA 

AAATACCAAA 

CCCTTTTCAT 

TGGCTGTTTC 

ATTAAACAAC 

TGTTTTAAAT 

TGGCAGCAAA 

CCTGAAGACC 

ACGATCATCA 

GTTGCTAAGT 

GTCTAACTAC 

TTATATTATT 

TGGGGGAGGC 

AGAGAAGGAA 

TACAACTCCG 

AAGGGAAGTT 

CCAGAATCCA 

AACTCCTAAA 

GGTGAAGCCT 

CTGATTCTCC 

GTGATGAGCT 

AAATAATTTT 

GTGTTGCTGT 

CTCCTGGATT 

CACCACCACA 

GCCAGGCTGG 

CTGGATTACA 

CTGTGTTCAA 

CTTTCTGAGC 

GGTGGGTAGG 

GTAACCACCT 

CTACAAAGTC 

GTCCAGGGTG 

TATTTATTGG 

G C ATGAGG AC 

ATTTTCTTTG 

GAGTGAAAAG 

GCCCTGAGCC 

TGGTGCGGCA 

GGGGTTTTGG 

GCCAGTCCTG 



TAGGGAGAGG 

ACTCAAAATA 

AACCCAGCCT 

GAACTCTTTT 

ACACAACTGT 

ATCAGTAAAC 

TGCTGCAGGC 

TACC AC CATG 

CAGGCTGGTG 

GGGATTACAG 

AACATTTAGC 

TAGTCCTGTC 

TCTATTTGCT 

TTTACTGAGA 

TTCTTTTCCA 

TTGAAGCATC 

TCAGGTAGAA 

ATGTCAGTGG 

GCAGTTGGCG 

CAAACTAGGA 

CTTGGAAATC 

CTATTATCTT 

CCATATATTC 

TAAGGAAATG 

TTCTCTGTGC 

AGAAACAAAG 

GCCTTCGGGA 

GCCTCATAGT 

GGGGGAGAAG 

GACTGAGGCT 

TCCCATCCGG 

TCACAACCCA 

TGGGTGTATC 

G T ATCTGG AT 

GTTGCTGTGA 

CACCCAGGCT 

CAGGCAATTC 

AGTGGCTAAT 

TCTCAAACTC 

CAGGTGTGAG 

ACCCAACTAT 

CTCAGTTTGT 

ATTGAATAAG 

AATAAGTGGT 

CGGCGGGGTC 

CCAGCACTAG 

TGATCAAACA 

ATGGGGGTAG 

ACACATGTAG 

CAAGGAACCA 

CTGGGTTCCA 

GGGCAGCCTT 

ACCCTGGACC 

CTTCAGCTCT 



GCAGAGGTGT 

CTTCTTATGC 

CTATCATCAT 

ATATCTCCCT 

TGCTTCATTT 

ATTAATTTTT 

TCAAGCAATT 

CCCGGCTAAT 

TTGAACTCCT 

GCGTGAGCCA 

AATTTTGTGA 

ATTACTCACT 

CTTCTCTGAA 

TTTTAAAATA 

GAACTGAATC 

TCATTGTAAG 

CTCAGTAAGT 

TTGGAAGAGT 

GATAATTGCA 

GAACGCAGAC 

AAAGCCCTGA 

GCTTTGCCCC 

AACATAAATC 

ACAGCGAAGA 

TGGTTAAACC 

GAGAATATTG 

AAGTAAAGAT 

CCAGAAAAAT 

TAAGTGCTAC 

CTTGCTTCAT 

TTCCTTCGCC 

AGACACAGGG 

CACGAGTCAC 

CCAGAGTCAC 

AGAACACAGG 

GGAGTGCACT 

TCCTGCCTCA 

TTTTTTAAAT 

CTGACCTGAA 

CCACCATGCC 

TGTTTCTTAT 

TAACTATAAA 

CTTATGTTTG 

AGTTGTTGGG 

AAAGGAATGA 

ATTGGAGGCT 

AAGAAGCAGG 

AAAGGTAGTG 

AATATACTCT 

ACAAGTCTGT 

TCCAAGCCAC 

CCACCATTTG 

CCGGACATCT 

TCTAACAACA 



TTGTTTGTTT 

CAAAGATGGC 

AATTAGAACT 

TAGGCAGAGA 

CTCCTATTCT 

TTTTTCCTTG 

CTCCTGCCTT 

TTTTGTATTT 

GACCTCAAGT 

CCATGCCTGG 

CTTTTTATGG 

CGGGTATGGT 

TCATAATAAG 

TTTCAAGGTA 

CCCCATCAAA 

CCAGTCTTAA 

CTGGTAGCCT 

CAATGTTATT 

GCTTTCTTTC 

CTTCTGACTT 

CCCATTAAAG 

AGTCCAGGTT 

AATTATATAT 

AAACAGACCA 

AAGGAGCTTC 

ATAAACCTGG 

GTGGCAGCCA 

TCCCACAAGC 

ATAGCCTTTC 

GCCAGTGCCC 

AACATCCACT 

GAAGGCAGTA 

TTGTCTCCGA 

ACTAACTGCA 

TTATTTTATT 

GGCACTATCT 

GCCTCCGGAG 

TTTCTGTAGA 

GTGTTCCACC 

CAGCCACAAG 

AAACTGGGTG 

GTGGAAATTA 

CTTAATGCTT 

GTGATCAGGC 

GAAAAGACAA 

GCAAAGGCCC 

TGGTGAGGAC 

GTGCATTAAG 

GCTGCTTGAG 

GCACTTTCCA 

AAGGGGTTTT 

GCACAGAGCT 

TCCAAGACTC 

TGTAGTAATA 



TTAATCTATT 

ATATTCTGCT 

TCTGACCCTG 

TGGAGGCCCA 

CAGAGGTGAT 

AGGCACAGCA 

GGCCTCACGA 

TTAGTAGAGA 

GATCCACCTG 

CCATCAATTT 

TCATCATTAA 

AATTTGGTCT 

AACTGCCAAC 

ATTGGTCCTA 

GTTCAATTTT 

CCCTTCTCTC 

CCAGGACTGC 

TTGATTTTTC 

ATTCCCTACA 

GTGGGTACCC 

ACGGATGGAG 

AACCATCTGT 

CCACTAAAAT 

AACGTCTGCC 

TGCTCTTTTC 

AAATAGGGCC 

GTATTCCCGT 

AGGGGCTGCT 

TTTTTGCACA 

CTCTGCACAT 

TCAAAGTAAC 

AATCTCCTGG 

TCCTCAGAGA 

AAACAAAACA 

TTATTTTATT 

CAACTCACTG 

TAACTGCGAC 

GATGGGGTTT 

CACCTCGGCC 

TTATTTTCAA 

AGCTTAGGCA 

CCGTATTTGT 

GGTAAAATTC 

CCAACACCAG 

GTTAAGAGTG 

TAAGCTCTGG 

GTGAGGGTAA 

CGTAGCTGTG 

ATAGTAGAGG 

GAGGCTATGA 

ATGCCCTAGG 

TGGTGTTCCA 

TTTTACATTA 

ATGATATCAT 
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GGTGGCAAAG 

TGTTCCTGTA 

GTGGAGCCAT 

CGAAAAAACA 

TTCACTTTTA 

GATAGATAGA 

ATATATCTAT 

TGGCCCATGC 

CTCTCATTGG 

AC CTGAGGTG 

TATATGTGTG 

ATATGTGTGT 

GGACCAAGAG 

CACTGGTCAT 

AACTTGCCAT 

ATTGTCTTCA 

CCTTTCTTCT 

CATCTCTCTT 

GTGGCACAAT 

CCTCCAGAGT 

TAGTAGAGAT 

ATCCTCCTGC 

CTCCTCCTTT 

ATAGCTTACT 

GTAGCTGGAA 

ACTGCCCACT 

GTCTACTTAA 

CACATGCCAG 

ATAAAAATTA 

GACATACAAA 

AATTAGGTAC 

TCAGGGAGTT 

TTCAAGATCT 

CCAAGGTATC 

CACAGTAGTA 

ATGCCCAGTC 

CTGTAATCCC 

GACCAGCCTG 

AG ACGTGG C A 

TTAAGCCCAG 

GTGACAGAGT 

ACATTGCAAA 

TAGTTATGAG 

AGCAAAAATG 

CTTTTCTAAT 

CTGGTAGGAG 

ACAACAACCT 

TATAAGGATG 

ACAG CCTGAA 

TATTAGACAT 

ACAAACTTAG 

GAAAAACCTT 

GTTGCATCTG 

AATTTATTCC 



GGAGACCCTG 

GTCCCAGCAA 

GATCGCACCA 

AATATATACA 

TATATAATAT 

TACAGATATA 

AGCATATAGA 

CTGTAGTCCC 

TTGAGCCATA 

GAAGGATATA 

TATGTGTATA 

GTGTATATAT 

TCCAGGTATG 

AGTTACGGGA 

ATACGTGGAG 

TCATCATTAT 

TCTTTCTCTT 

^ZTT^TT TT!* T '1' T 

CTCAGCTCAC 

AGCTAGGACT 

AGGGTTTCAC 

CTCGGCCTCC 

AATAGACAGG 

GCAGCCTCGA 

CTACAGGCAT 

GATGACTAAG 

TTTTGAAACC 

AAGATATCAG 

TTAAAC CAC A 

AAGAATCTCA 

CATTTTAAAT 

TTGGAGGAGT 

GTAGTATCTG 

TCCCTGGAGG 

TTGTCTGCAA 

CACAAGGCAA 

AACACTTTAG 

AACAAAATAA 

GTGCTTGCCT 

GAATTTAAGG 

GAGACCCTGT 

ATGGTGAGAG 

ATCCTACAAC 

CCCCATATAG 

CTGTCACAGA 

TGTATGTTAC 

CGGCAATCCC 

CATGGACTAG 

GCTCCATCAG 

AAAAAGTAAC 

GGAAAGATAT 

GAACTTTCTC 

CTTGAAGGCA 

TATCACTTTT 



TCTCAAAAAA 

CTGGGGAGGC 

CTGTACCACT 

CACAGTAATC 

AGATTACATC 

GACATAGTAT 

GATGCTGTCT 

AGCTACTGGG 

ATCGCACTAC 

GATATAGATA 

TATATATATT 

ATATTATGAA 

GAGC CAACAT 

AAAGAAGGTC 

AGTTCTGGTG 

AATG CTATC A 

CCTTCCCCTC 

TTGAGATGGA 

TGCAACCTCT 

GCAAGTGCAC 

AATGCTGGCC 

CAATGTGCTG 

GTCTAGCTCT 

ACTCCTGGGC 

AG CACACGGG 

CTCTTTGGAC 

CTATTTATCA 

CTATAATAAG 

GTAAATCATG 

ATCTCACTAG 

CTGTAAGACT 

GAGAGCTCTT 

GTAAAATTAT 

GAACATTTAC 

CAGCAACAAC 

TGGATTAAAT 

AAGGCCGAGG 

AGAGATAGTG 

GTGGTCCCAG 

CTGCAGGGAG 

CTAAAAGAGA 

AGTGGTTTCT 

ACTCACCTAA 

AGAGGACTGT 

CTAAAGAGTG 

AACTTTGAGT 

AGTTTGGGGT 

AAAGTTATTG 

TAGGGATATG 

GAGTAACATA 

AGATCACCCT 

CTTATATCCT 

ATGTAAAATA 

GTAATAAAGC 



AAATTAAAAA 

TGAGGTGAGA 

CGGCTTGGGC 

AATATATATA 

TTATTAGATA 

CCTCTATCCA 

CAAAAAAATT 

GAGGCTGAAA 

TGCACCACTC 

TATAAATAAA 

ATGAAGACAC 

GACACTGGTG 

GCAATGTTGT 

TCCAATGAGA 

TGT AT AT AG C 

GAGCAAAGAT 

CCCCACCTCT 

GTCTTACTCT 

GCCTTCTGGG 

ACCACCACAC 

AGGCTGGTCT 

GGATTACAGG 

GTTGCCCAGG 

TCAGGAGATC 

GCTAATAAAA 

ATAAAAGACA 

AAAAACAGGA 

TTCCCATAAA 

GGTAAATCAC 

TGAAATCAGA 

GTCAAAATCA 

ATATTGCTTG 

GATATG CATC 

GGGACACAAG 

AACAAAAAAA 

AAACTTCAGG 

CGAGAGGACT 

TTTCTACAAA 

CTACTGGGGA 

CCATGATGGG 

TAAGTAAATA 

AGACTCTAGA 

CCTCTCTGTG 

GATATAAAAC 

CTCAGTATAT 

CAAGTAATAT 

ATGTTCCCAA 

TAGCAACATT 

GTTACATATA 

GAAGAGACAG 

ACCTAGAGAA 

TTATATTGTT 

AAATAAACAT 

TGGGCACAGT 



ATTAGCCAGG 

AGATCACTTT 

AACAGAGTGA 

TTATATGTAC 

TATAGTATTC 

TATTAGAGAG 

TAAACATCAG 

TGAGAGGATT 

AGCCTGGGAG 

TATGTATAGA 

TGGGAGAGAA 

GGATGGTTTC 

TGTTGACTGA 

CATACTTAAC 

CTTCTCTCAC 

GACAGCTAAA 

TTCTCTTCCT 

GTCGCTCAAG 

TTCAAGCAAT 

CTGGCTAATT 

CAAACTCCTG 

CGTAAGCCAC 

CTGGGTACAG 

CTCCTGCCCT 

TTAATTAGGT 

CAGACCTTGA 

TGAAAATGCA 

TCAATAAGGA 

AGAGGCCTGA 

AAAGCACAAA 

TAAATTATAT 

TGGGGTAGAA 

CCTCACACCA 

GAAGCATGGA 

CCCAACTACA 

CCGGAGATGG 

GCTTGAGCCC 

AAATTTTTAA 

AGCTGACGTG 

GCCATTGCAC 

ACAACTTTGC 

CTCTTTCTAT 

TCATATTTCC 

AAGAACCAAG 

GTGAGTCATT 

GGTACCATAT 

AAGAAATGAA 

GTAATAACTA 

TTTATTATAT 

TGTATATATG 

GTCAGATTGG 

TGACTGATTA 

ACATTTAAAA 

GACTAACACT 



TATGGTGGCC 

AGCTCAGGTG 

GAGCCTGTCT 

CAATCAATGC 

CTTCTCCATA 

AGGATACTAT 

CCAGATGTGG 

GCCATTGATC 

ACAGAGGGAG 

GAGAATATAA 

TACTATATAT 

ATTACCAATT 

GCTGGCAGAG 

AAAATATATG 

CAACCTAGCA 

TTTTTTTGTC 

CCTCCTCCTT 

CTGGAGTGCA 

TCTGCCTAAG 

TTTGTATTTT 

CCCTCAAGTG 

TGTACCCGGC 

TGGCGTGATC 

AGTCTCCCCA 

GATAAAATTC 

AGGAAAATGT 

AAATGCCATC 

AAAGAACCCA 

AGGGCTAATG 

TTAAGTACAC 

AAGTAAAGAC 

TTGGAACAAT 

GCATGTCACT 

TAAGAATGTT 

CACAACTTCA 

TGGTTCATGC 

AGGAGTTCAA 

AAAATTAGCC 

GGAGGATTGC 

TCCAGCCTGG 

ATTTTCTGCC 

GACTACCTTC 

TCCTCTATAA 

AAAAGTAAAG 

ATTCCTGGTG 

ATTAAGATTA 

AGCACCAGGA 

AGTTCTAAAA 

TCTTATGGAA 

TTACGTTTGT 

AGACGGGTGG 

AAATGTATTT 

ATAAAAATAA 

TGTAATCCTA 
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GCACTTTGGG 
CCAACATTGT 
GTACCTGTAA 
GCAGTGAGCT 
AAAAAAAATT 
AGTTAAAAGA 
GTGAG CCGAG 
AAAAAAAAAA 
CCTTCTGTAA 
AAAGTCTTGT 
TTGTTGTTGT 
ATCCCTGGAA 
AATGCATATA 
TCATCAATAT 
AAATAGTCTC 
GCCTTCTATG 
GTGAG CAGGG 
ATTTCTATGA 
AGTCTGTCTC 
AGAGCTGAGG 
CAGGGCCAAA 
GAAGTGGATT 
CTGCAGCCTT 
CACAAAAATT 
TGTACTAATA 
GCCAAATGAA 
GATACAAATT 
CAGTAAATGG 
TATCTAAGTC 
TGCATTGCTC 
CATGATGTGC 
GCATTTAATT 
GGGGGAAACC 
GGTTTGCACA 
AACTAGAGAA 
CTGCCAAGGG 
CAGGAATTCT 
AAGCTTGGAA 
CTTCACAGCC 
TTCATTTCTA 
AGTGTCATTC 
TTCATGGGGC 
TTACCCTAGT 
TTAGGTATTG 
GGCAGGATGT 
AGCTTTGTCC 
GAGCCCAGAT 
TGAGACTGGG 
CATGCTATAC 
ATGGCTCCAG 
GAGCAAGGGT 
TCATCACCCA 
TGCACATGGG 
GCTGTGTTCC 



AGGCAGAGAC 
GAAACCCCAT 
TCCCACGCTA 
GAGATTG CGG 
TGAAAAAAGA 
TAAGCCCATT 
ATCGCACCAC 
AAAGAAAGAA 
AAGATGAGAG 
TCTTCCTTCC 
TTGTTGTTGT 
CAAAATGGGC 
CTCTAAAGTT 
ATCTGCAGCA 
ATCTCAGTGC 
ATTCCTGCCT 
CTTATGAATT 
TTACGTTTCA 
CTGAGCTCTC 
GGTGGCCTGT 
GTCCTGCAAC 
CTTCCTTAGC 
ATCAGACCCT 
GAGATAACAT 
GATAACTAAT 
TCATGATAAA 
TGTGAAAGTA 
TTCTAACATT 
AAGTTCCTAA 
TAAGAGTTGG 
TATTATGATC 
TTGATGATAA 
AAGTCACTTG 
GATAACCAGA 
AGTATGAAAG 
ACAGGTGGTA 
TTAGTGAACA 
ACATTGCACA 
CAACTCCTAG 
TATACGCACA 
ACATTCTATA 
TTGTCATGCA 
CTTAGTCTTA 
ATGGATACCC 
TTATCAACAT 
ATGCTCCCTC 
TAGAGAACAC 
CCACTGCCAC 
ACTCAGCCTT 
ATGTTTATCT 
TCTACAAGCA 
TCAGTCACCT 
CCAAGGAGGC 
CTCAGAAGTT 



AGGCAGATCA 
CTCTACTAAA 
CCCGGGAGGC 
CACTGCAAGC 
AAAATTTTAA 
TAAGAAATAG 
TGCACTCCAG 
AGAAAGAAAG 
TAGGCAAATT 
TGTCATCTGT 
GTATTTGAGG 
TTTG CCATTC 
CAACCCCATC 
AAACATTTAT 
CGTTCAGGGT 
CTTGGTGTTC 
GCTTCTGACC 
TTATGTAAGA 
TCTGAAGAAA 
AGAAGCTGTG 
CATCAGGAAA 
CTAGCCTCCA 
AAGCAGCAGG 
ATCAGTGTTG 
ATAACCACCA 
ACTTTCCATT 
TAGTCAGCAC 
TTGGAATCTG 
AATATGTGAA 
CTGAGCTATT 
ATGTGTCAGT 
CTCAATGAAG 
GAGTAACATG 
ATGCAATGCT 
CTCTACTGAA 
AACTTGGTTA 
TACCTTGATG 
TAGTATGAAG 
AATAAGCAGA 
CCCCTCCCCT 
AAGAAACAAA 
GGGCTATTCT 
GATATTGATG 
AGATGGAATA 
TTGCATCTAT 
TGTAAGGACT 
TTATCATAAA 
TAAGATGGTG 
ACAGTATAGT 
TC CT AC AG AT 
AATCAGGGAA 
AGTGGAGTAT 
AAACAGTGGT 
TATTTTTCTA 



CCTGAGGTCA 
AATACAAAAA 
TGAGGCGCTG 
CAGCCTGGGT 
TAAACAGTGT 
TTTCACTTGA 
CCTGGGCGAC 
AAATAGTTTC 
GACTCAGTGA 
ATAGGATGAA 
GGAACACAGG 
AAATTAGTTT 
ATGGCCTAAG 
TCAAATTAAG 
TGGCCACTGT 
ACACCCTCGT 
AATAGGATAT 
CTCCATCTTG 
TAACTGGCCA 
GGCAACCTCC 
GAAATTCTGC 
GATAAGAACA 
CCCAACTAAG 
TATTAAGGTT 
AATCATTTCA 
TTCAGGGGTT 
TGATTTAAAA 
TAAATTGGTT 
TGATAGGTTA 
GATAATAAAC 
CACAGGGCTA 
TAGGAGCTGT 
G CT AATAAGT 
CAT C AC ATTC 
ATTAACTAAA 
CTG C ATAAGG 
ACTCCTTAAC 
TTCCAAGGAG 
GGCGAGAGAT 
CCTGCATTCA 
AAGAAAAGGT 
TCTTTGCTTT 
GATACTCAAA 
ATTCCTACCA 
TCTCATCCTT 
AGCTTTTGGT 
GGTCCTTAGT 
GTAGCAGGTA 
CACCAATCCT 
AAAGCTGTAG 
AAGGTTATCA 
TTCAGGAGAG 
AAATGTTATC 
ATTGACATAA 



GGGGTTTGAG 
TCAGCCAGGC 
GAACCCAGGA 
AACAG CGAGA 
TTAAGAGGGG 
CCCGGAAGGC 
AGAGCGAGAC 
ACTTGAACCA 
AATCCCAGCA 
ATACAGAGTG 
TCTATAATTC 
AGAAGTTATA 
GCAGAGCCCT 
TGGGATAAAT 
GGAAGACAGA 
AAAATTCCTT 
GGCAAAGATG 
CTGGCAGATT 
TGTTAGAAGC 
AGCCAACAGC 
CTGCTACCTC 
CAGCCTGACC 
CTGTGCCCAG 
CTAAATTATG 
GGTTAGGCCA 
TTTTTGATTT 
AATCAAGGGA 
GTAACATTTG 
TCATACTCAC 
ACTATGATCA 
AGCACTTTGT 
TAATATTTTC 
GAAAGAATAA 
ACTGAGCAGT 
CAACCTCTCT 
CCCCTTCTAT 
ATTTTCTTCA 
ACAGCCTCTG 
TTCTTCAGAG 
AACAGGACTT 
GAG CATGGGA 
ACCCGAAGAA 
CAAAGTAATT 
GCTTCTGGGA 
GCTGAAGTCT 
GATCGGATTT 
GGTGAATCTG 
TCACACAGTG 
GTTAGTTAGA 
ATTGTACCAT 
CTCATTTTGG 
AGTCAACAAC 
CCGTGGTTTC 
AGGTACCCTA 



ACCAGCCTGG 
ATAGTGGTGC 
GGCAGAGGCT 
CTCCATCTCA 
AGAAATATTT 
GGAGCTTGCA 
TCTGTCTCAA 
TATTATGATT 
AAACTTACAC 
CTTTTGGGTT 
CTTTTCTGAA 
AAGGCAAAAA 
GTAATCAAAT 
AAAGACTTTT 
CTCAAGGGTG 
GTCTTTGAGT 
ATGGGATATA 
TTCTCTAAAG 
CCATGTGCAA 
CAGAAATAAC 
AGTGAGCTTG 
AACACCTTAA 
ATTCCTGAAC 
GTAATTTGTT 
GATTTTTGTA 
TGTACTTACG 
GCAGGAAACT 
TCATCTGTGT 
CTACTTTTCT 
GATCTAATAC 
ACATGTTGAT 
ATTTTTCAGA 
GAATTTGAAA 
GAATCATACT 
GGCTGTGAGC 
CCACAGTATT 
CATCGAAGTA 
ATGTTTC C AG 
GTGCATTCCA 
ACCTGCTCAA 
ACATCGGTAT 
GTAAAGAGAG 
CCCACCAGTC 
GATTCAGCAT 
GAGGGCCAGG 
CCTTCACAGT 
TGCACAGCCC 
GTAAAGCAAT 
ACCAGAATTA 
AACAGCTCTG 
CTGCCCCACT 
CAGGGTTCTC 
ATTTGGCCAA 
TAAATTAGTG 
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AAGGCCAGCC 

TTCCTTACCA 

CAGCCTCACC 

GTGGCTCTTG 

CAGAGTCTGT 

GAAACACTGG 

CTGGAGTCAA 

ATGGATAAAC 

CAAGCAAAAT 

TCTGGAGTAG 

GACCAGAAAA 

ATGAAACTGA 

GTAAAAGCAC 

TATATATTTT 

ATGAGTCACT 

GCAGGCACTC 

CTTGTTCTTC 

TGAAACTAGG 

TATGATTTAA 

TGAACATCAC 

GGCAGGGGGA 

ACTACCCTAG 

TCTCATCTGA 

GGAGTGTGTA 

CCAAATAAAC 

ACAGTGTTTT 

TTTGTGTAGG 

TTACAATCAA 

TCAGGAGCTC 

TGAAAGAGAG 

AAATATAAAG 

CGCAATTACT 

GAGGTTAGTC 

CATTACTTGT 

ATCCCAGCAC 

CTGGCCAACA 

GCACATGCCT 

GAGGTGGACA 

AAAAATAGTA 

ATGAGC TAG A 

GCTTGGTTTC 

GGTAATGAGT 

CTCGATGCTT 

GGAATCCCAG 

AAACCTGGAA 

AATCTGGTTT 

AGCTCTAGAA 

TATTCGCATT 

TGTCACAGCT 

TTGACTCTTC 

GTGAATATTT 

CAGGAACATA 

CTGTGCTGTA 

TGCCAGGTGG 



TGATGGCACT 

TTCTCCTTTA 

TGCAGCAATT 

AGTTCATTGC 

TGTTCTTCTT 

AGATTCTTAA 

GAAAGTATGG 

AAGAATGAAA 

AAGCCAGAAA 

TTAAGTTCAT 

TGGACAGTTA 

GTTGCAGTTT 

TTAATTCTAC 

CACACAAACA 

GAAGCTGGAA 

AGATGTGGAA 

AG C CAAG AC A 

GCCAAGTTCA 

GAAGTTAGAT 

CCAGACTGGA 

GCTTTGAGTG 

GGTCATCTTA 

GGAGATGTAA 

TTCAGGGCAA 

TCTCTACTTA 

GATGACTATG 

AGACAGGTGC 

GGTCTTCCCC 

AAGAGCAACA 

AAAATTGGTA 

ATTGGTGCAA 

TTTGCACAAA 

AGCCACGGAA 

GATGTACTTA 

TTTGGGAGGC 

TGGTGAAACC 

GTAATCCCAG 

TTGCAGTGAG 

ATAATTTAAA 

GTAGTATGAA 

TGTACCTGTA 

TGAGGCCTGC 

TTTCTAGGTA 

CCAACTACAG 

CCAGGGGCAT 

AGTGAGAATA 

TGGAAAATAG 

GCTGTGGACA 

TTCTGGAGCT 

ATT AGGGAC C 

ATAGAGTCCT 

CTGATCCCCT 

GAAAATGAGA 

TGGAGCAACA 



GATGTACATC 

ATAGCACTAT 

TCTGGGCTGA 

TCTTCAGCCC 

TCTCCAGCCT 

AGTCAGACCG 

TCAAAAGGTG 

GTCTGACACA 

CAAAAGGGCA 

AGAGACAGAA 

TTGTTTAATG 

GGAGATGGGA 

TGAACTATAT 

CACACACACA 

GAATGTCCCC 

GAGGTTGCCT 

CAGGAGAAAG 

AACACTTTAT 

TTCTGAGTTG 

TTTTAAGACA 

TGACAGGCAT 

AGCATTCCTA 

AGTTGCAATT 

TTTGAATCTA 

TCTTAAAAAA 

ATATAGAAGA 

AGCTTTAACT 

ATTGCCCAAG 

TCCACAAACA 

ATCAG CTTGT 

AAGTAATTGT 

CCTAAATATT 

ATAATCTGAA 

TAAATCAAGT 

TGAGGTGGGT 

CCGTCTCTAC 

CTACTCAAGA 

CTGAGATCGC 

AATAAATAAA 

TTTCAATCTG 

AAGTTGAGAG 

AAACCAGGTT 

AATAGTCATA 

TTTAAAGATG 

AAGTACAAAT 

AATCCTCATT 

CTTGAGATCA 

GCTTCTGCTC 

TTTCCTGAAG 

TAGGGGGAAT 

CATTGTTTTT 

TGGCAACCCT 

CTAAGAAAGG 

ATTGCAATCT 



TAAAAGAAAC 

AACATACCTT 

GCCCTGACAT 

CAGACCAGCC 

CCAGAGATAA 

GATTTTTTGT 

GAAGTAAACC 

CGCTACTACA 

AATATTGTAA 

AGTAAAATAG 

GGTAGTGAGT 

ATGGTGATGG 

ACTTAAAAGT 

C AC AATC AG C 

AGTTTCCTGC 

CAGATTCCTT 

CTGGGTTAGG 

CAGTTACAAG 

GAGCGATTTT 

ACCAGGCTGG 

TTATTATGGT 

TGTGTAAGAC 

TCCATTAGTG 

TGTTCTTGGA 

ATAAAAATTA 

AGGGTCTTTG 

CTTGTATAGA 

ATCCTAGAAA 

AAGATCAGGG 

GGGATTTTAC 

GGTTTTTGCC 

TCCATAAAAG 

AGTTTGTAGT 

ATAGGCCGGG 

GAATCACGAG 

TAAAATACAA 

GGCTGAGGCA 

ACCACTACAC 

TAAATAAAGT 

GAGTGATCCT 

CCAGATGCTC 

TATTTTGACG 

CTAATTCTGC 

GAAAGATTGG 

AATGGTTTCT 

GTGCTTTTCC 

ATGAAGTCAG 

CGTACATCTG 

GAAAAATTTG 

CCCAATCTTC 

TGCTAGAGAG 

GAATAGTTGG 

GGTAAAATAA 

CATCTGCTGA 



ATTACTTTAT 

TTTTCCCTAC 

TTTTCCTCCA 

TCATAGTCCC 

GACTTCTCTT 

CTCTGAATCT 

AAATGTCCAT 

TGACAAGCCT 

GACTTTGCTT 

TGGTTACAAG 

TTCAGTTTAG 

TTGCACAACA 

GGTTAAATGC 

CACTGGGACA 

TGCAGAGTCA 

ATAGTCACCC 

AGTGCTAGAT 

GATAAAAAGA 

CTTGAAGTAA 

T AAGAGGGT C 

TAACTGAGAA 

TGACAGAAAT 

CTGTCTAAAT 

TTGCAGTCTT 

AAAAATAAAA 

ACTTAGGATG 

CGGGTTTTCA 

TGGGGGAAGT 

TAGAGGTTAG 

TGCAAGCTAG 

TTTACTTTAA 

AATGTGGCTC 

TGCAAGTGTG 

TGCAGTGGCT 

GTCAGGAGAT 

AAAATTAGCC 

GGGGAATTGC 

TCCAGCAAGA 

ATATTTCTTT 

GTTTTCTAAG 

CACTGTGGTA 

TATTTAAAGT 

TTCTTCTGAC 

TG CTAAAT AC 

TCCTTGGGTT 

TCAATCATCC 

ATTCTTACTT 

TCTTCAAGTT 

ATAAGTGAAG 

TAAGATATAT 

CATGCTAAAG 

TAGGATTTTA 

CTTG CC C AAA 

CCCAGAGCCT 



CTTCCCATGC 

TCCAAGTACA 

GTTCCAGGAT 

TCAGTCTACT 

CCTCATGTAG 

GTACCTTCTC 

CTATGGATGA 

TGAAGACATT 

ATACAAGGCA 

GTGTTGGCAA 

AAGATGAAAG 

ATGTAACAAT 

TTAAGTGTTA 

TTATTTTCTC 

TGTGTGGGAG 

AATTAATTTT 

AATTTAATTG 

GGTTTTTACT 

AAGCTTATAA 

CATAATTCTT 

ATACTGTTCT 

CAAGTGAAAC 

TAATGCAGTG 

CAAACTTGGC 

ATAAATTCAT 

AGGTGGAATT 

TATATGTTAG 

AAGAGTGTAC 

AGAGGACTCC 

TGAATTATAT 

TGGCAAAGAC 

TGATAATGTG 

TAGGTTGTTG 

CACGCCTGTA 

CAAG AC CAT C 

AGGCATGGTA 

TTGAACCCGG 

CTCCATCTCA 

CATCAGCTTC 

TGTTCACAAA 

AAAGTGCCAG 

TTGAGACCCA 

TGAAGTATCA 

TCATGGATGT 

TCATTTTTTC 

CCTATGCCTA 

TCCATTTAGT 

GCTTCAGTTT 

CCTATTCAAT 

TTGAATAATA 

GCTATATGTG 

AACTTCATTT 

GGGCTATGAC 

GAGCTATGTC 
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CACCACTAGA 

GATTTTGTAA 

AATTTTGTGT 

ATGGAAGAGG 

ATTTAGTATT 

GAAAGTGATT 

CTGTCCTCTG 

TCTATGTAAC 

ATGCATTCTA 

GGAGGGAGGA 

TAAGTAACGT 

TGGTTCCTTT 

TTATTAGCTG 

TGACATGTCC 

CAGAATAAGG 

GGAAAAAGGA 

TCCTTAATAT 

ACCTTCCACA 

CTGCCCTCTC 

ATCATAGACA 

GGAGGCTGAG 

CATGTCTCTA 

GCTACTTGGG 

AGCCGAGATT 

AAAAAAAAAA 

CCCTTCATTC 

AGTTACTTCT 

GGAC CTCCAG 

AGGTGATGGG 

TTTTCAAATG 

ACCTGTTCTA 

CCATCCCATT 

TATTTTTCAG 

AGTTAAATCA 

GGCCCACACA 

TTTGTGCTGT 

CAGGTGGAAC 

TTCAATGAGG 

ACACTGGGGA 

GTCTGTTATA 

CAGGCTCAGG 

GGCCATAGCT 

GAATTTTCCA 

AACTGACTTC 

GAAGAAAAGA 

AACTGCAACC 

GTTTAGAGGA 

TAAAATAGCA 

TATGTAGGTA 

AACTGAAGAC 

TCAGACTGCT 

TTTTTTAATG 

GGCTCACTGC 

GTAGCTGGGT 



GTCCTGCCAG 
AACAACATGC 
CTTATGAGTC 
GGAATTCTGG 
CATGTCAAGC 
TGAAGCTGAC 
TAAACCCAAC 
TGTTTTAGCA 
TATTTTAATG 
CACTGCTATT 
GCCCAAAATT 
TCTTACTACT 
CTCCATGGAG 
TTAGAAGGAC 
AGCCTGACAC 
CATCAGGACT 
GCTTTCTGGC 
ACCAAGCATT 
CTCCCTTTCT 
TGCCACAGTA 
ATGGGAGGGA 
CAAAAAATAA 
AGGCTGAGGT 
GCACCATTGT 
AAAAAAAAAA 
CCCAAATGAA 
AAGATGAACC 
GCAGACATTT 
TCTTTCTTCC 
ATATTCATGG 
AAGCAAAAAG 
GAAGGTCAGG 
AAGAATGACA 
TCTCAGCTCC 
CTCACCTTGT 
GGTGCCCGCT 
TGCTCTCCGT 
CTTCCGAGCT 
C AG C AG AATG 
CACAAGTGGC 
CAGATGGAGC 
TTTATTGAAA 
CCGTGATGAA 
CATAGGTCTT 
GGACCTAAAA 
CAAGTGCAGA 
AAAAAAGCAG 
TAAGCTTTTG 
ATAGATGAGG 
CTATACTCCT 
CTAAATTATT 
AGACGGAGTC 
ACCTCCGCCT 
CTACAGGTGT 



GAAAAAGTTG 

TGAACCAAGC 

AGGAAAAATC 

TATCTTTTGA 

TTTTAGCTTA 

CCAAATTGCT 

AAGTATACTC 

AAAGATGACA 

TCCTCAAAGC 

ATCCCCATTT 

GCCCATCTAG 

TGGTGGAAAA 

TTTAAGGAAG 

TTAGAGCCTT 

TGGAGACAAC 

ATGCCCATTC 

AAGAAATCTG 

TCCAAATCTT 

TATACGGATA 

GCTGGGCACA 

GACCAGGGGT 

AAAAATTATC 

GGGAGGATTG 

ACTCCAACCT 

AAAAGTAGAG 

AATCCCCCAA 

ACTCTCTACC 

TTGATGGTTT 

CTGTTTTCAG 

TGTGAAACAA 

AAATTCATCA 

ATGGACAGTC 

TGATGAAAGC 

TGGGGAGCAG 

AGCCCTGGCA 

CACAGCGCCA 

GTTCCTCACA 

GCTTGTTGGG 

TCTCCTGCCT 

AGTAGCTGTG 

AGGTGGCTTC 

AGCTCCAATA 

AATACACCTC 

GAAGGTTTTC 

GGAAGAAGTT 

GTTTCAAGTT 

TTTTAAAGCA 

ACTGGCTATA 

CAGCCAGTCA 

GCCTCACTTG 

TCATTATTTT 

TCACTCTGTC 

CCCGGGTTCA 

GCACCACTAC 



GATATAGAAC 

AAAACCAATA 

AGGATG CCAG 

ACAATGGTAT 

TTCTTCAAAA 

AATTGTAGTC 

ATTCATTCTC 

TTGTCCTTAC 

TTATAACCAC 

ACAGATGGAG 

TAAGTGACAA 

GTAATTCAAA 

AGCTGCCATG 

CATACAAGAC 

ATTTTCCTCA 

CTCCATGCTG 

GATGGTACAC 

TGACTCTTCT 

GTTTGAATTT 

GTGGTTCATG 

TTGAGGCCAG 

CAGGTATGGT 

CTTGAGCCCC 

GGGATACAGA 

GTACCAGAGT 

TAGGTGTTCA 

CTAAATGTCC 

GTTTTCTTTA 

GCCCTACTGC 

GAAAAAATGC 

TAACACAAAT 

TAGATAATTG 

TGTATTTCCA 

GATAAGACTT 

TACGTCTTCA 

GCAGATGAGC 

TGACATTTCT 

TCGGAGGCTA 

CAGTTGCTTT 

TCCACAGTTG 

CTCCATCATC 

TTGGCTCTAG 

ACCTGCACCT 

CTTCCAACCC 

GAGGCTGAGG 

GCCCTCATTA 

GTTTTAAAGT 

CATTGTTCTT 

GGAACAAAAT 

TCCTGATAAA 

TCTTTTCTCA 

ACCCAGGCTG 

AGCGATTCTC 

GCCCAGCTAA 



AAGGTAATCA 
CCAGTGTTTG 
CTGGTTATTA 
CATGAATCCA 
CAGTTTCTCA 
AATGCTGAAA 
GAGTGTTCTC 
TATATGCCAA 
CTCCTGTGTA 
AAACCAAGGT 
AACTCAATTT 
TGGGAATATG 
AGCTGAGTGG 
CACCTCTGCC 
AATTTAGGCA 
CCAACAG CAA 
AAAACCTCTC 
TCCTGAATCG 
TACTCCTTGA 
CCTCTAATCC 
TATAAGCAAG 
GGGGCATCCC 
AGAAGGTTGA 
GCAAGACCCT 
GATATTTTCA 
ATTTTTACGT 
CTCCCCACCA 
C TAG ACTGTA 
ATGGCTTTAC 
GGGTGTTTGG 
GGATAGAGAT 
AGCAAGAAAT 
AGTCATAATG 
GGTACTTACC 
ACAAGAGCTG 
TGCCCCTCGT 
TGATCCGTCT 
TCCATATGAA 
TGGCTTGGGT 
ATGCTTACTG 
TTCTTGGTGC 
AGATGGAGAT 
CTATGTGATG 
CTATTATCTC 
TTGTTTGGGC 
GCAAGCAGTT 
TGTTTGCCAA 
TGTATTACAA 
GCTTTTAAAC 
TTTTG CAT AC 
GTCTTCTAAC 
GAGTGCAGTG 
CTGCCTCAGC 
TTTTTGTATT 



TCATCTAAAA 
GCACACATGA 
GAAACAGTTC 
ATTTAAAATG 
TATTTCTATT 
GAATTGTCTC 
AGGAAAAGGT 
GTGCTATTCT 
TGTGTTTTAG 
GTGAAGACAT 
CAACATAAGC 
ATCATCGCAG 
TGGTCATGAT 
TCATGGAGGA 
GGACAGAGAA 
AGTCCCACCT 
CCTCTGCTTC 
TGCTTAAAAT 
TATTCCTTTT 
CAGCATTTTG 
AAAGGCAGAC 
TGTAGTCCTA 
GGCTGCAGTG 
ACCTCAGGAA 
ATGTCACTGA 
GTCCTTCAGG 
CCAAAACCAG 
GATAC CTAAA 
ATATTGTGGT 
TTTGAGAACA 
AAGAGTCCAA 
CATCATAAAC 
TTAGGTTTCA 
AAAGCTCCCG 
TGGTGTGCCC 
CTTCGCAGAA 
CTTTGAGGGC 
ATGGAGCCCG 
TTTTAAAGAA 
GGTTCGTCAT 
TGGTGGTTGA 
GAAGCAGCCA 
AGCTGGCTGC 
ATTTTGTATT 
CACGTTTGAG 
ACAAGTGGTT 
GAATTTACAT 
ATCTCGGGAA 
ATGGGGTCTT 
CTCACATAGC 

ACGCTATCTC 
CTCCCGAGTA 
TTTAGTAGAG 
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ATGGGGTTTC 

CTCCCAAAGT 

ATAAGACAAG 

CTCACTGCAG 

GCTGGGACTA 

ATGCATTGGA 

ATCATGAGTA 

CTGTTCCAGT 

AGCCTCAATA 

AGCAATGCAG 

TTCCTATTCT 

GTAATCCTAG 

CCAGCCTGAC 

GTGGTGGTGC 

GCTCCGGAGG 

AAGAGCGAGA 

GTAATTAGTA 

TGGGGAGCTT 

CTTAGAGATC 

CTAGACAAGA 

AAAAGAGGAC 

TCTCAGGCTC 

CATTCATCCT 

ATTACCTATA 

AACGTTATAC 

TACATATTTA 

TACCCCCATT 

ATTTGTATGC 

AGAAGG CAAA 

CCACCTCGAC 

GTTGTTTTTT 

AGGCAAAAGA 

TGGGACTTCC 

ATCAAGGCAA 

CTGAGGGAAT 

TTATTTCTAT 

CCAGGACAGG 

GACCCAAGGA 

G AGTGAC CAG 

CCCCGATCCC 

GGGGCAGAGT 

AATTACCGGG 

TTTCTTCTGA 

TCTCTGCTGG 

GGTTCCTGTC 

CTGGTGAAAG 

TTTCAGAGTT 

TACACACTCA 

CTCAGACACT 

AGTTAACTCA 

GATTTAGGAG 

GAGGGGGAAA 

TAATAACCCT 

CAGAGGTTCA 



ACCATGTTGG 

GCCAGGATTA 

TTCTCGCTCT 

CCTCGACCTC 

CAGGTATGTG 

ATTTAGAGGA 

GGAATTTAAA 

TGCCACATTT 

TACAACATTC 

GCTTCCTCAC 

AGTTAACTTC 

CACTTTGGGA 

CAACATGGCA 

ACACCTGTAA 

GGGAGGTTGC 

ATCTGTCTTA 

GTAACACTTA 

CACCCCAATA 

AGCAGATGCT 

AGAACAATTG 

TAAGAATGTA 

ATTCATTTTC 

CCCAAATATT 

TTCTCCTGAT 

ATACATATTT 

TATTTATGTA 

GGCAGAGGGG 

CTTTTCTCCA 

GGGGAAGTGT 

CCCCCCCATC 

GTTTGTGTTT 

AAGAGAAAGG 

CTGGCTAATA 

C C AG AACAAC 

AAGAATTGGA 

GGGATCAGAG 

ACCTATCTCA 

GGTAGGGAAG 

GAGCGAAAAA 

TCCCCCCCGC 

CAGGAGGAAG 

TAGGCTGTTT 

AGACGTGTAT 

GGTCACTGCT 

CTGGGCCCCA 

AAAGAGCAGG 

GGAGGGGCCC 

CATATCCACT 

TACATATTCG 

GACCGGATTA 

AAAACTAGTG 

CGTAGGAAGA 

GTAAACTATC 

GTTCACAGAC 



TTGGCTCGAT 

CAGGCATGAG 

CTTGCCCAGG 

CTGGGTTTAA 

CCACCATGTC 

TACACAAACA 

TTTAACTTAA 

TGATTGCTTA 

TGTTATCACA 

AAAAGCTGTA 

AAGAATAATT 

AGCCGAGGCG 

AAACCTCATC 

TCCCAGCTGC 

AGTGAGCCCA 

AAAAAAAAAA 

TGCAATTGGG 

TATGACTCCC 

GGAAGAGACT 

TTTTTCCTTC 

ACCAGACCTA 

CAAAGAGAAC 

CATTTATTCT 

ATCACCCTTC 

ATACAGTATA 

TTTATACATA 

GTAATCACTC 

ATTAGCCTGC 

TCCCTTGGCT 

CCCCACAAAG 

TTGTTGTTGT 

AGAATAGTGA 

ACGTCTTGCT 

CAGAAGAACC 

AAGAAGGCTG 

CTCCTGCAGA 

AGAGACATGT 

GCAGAAAGAA 

GCCTGCCTCT 

CCGCCCCCAC 

TTTGAAGAGT 

TCCTCTCACA 

TCCTTGGCAG 

CTTCTGGGGA 

CTCATCTAAG 

AAAGAGGTGA 

TGCTGTCACG 

GAGAAAACCT 

CTGCTAGTCC 

AACTGAGAAG 

ACGTTGTTCA 

AAATATCCTT 

ATGTGACCCC 

TCTGATTTGA 



CTCTTGACCT 

CCACCGTGCC 

CTGTAGTGGA 

GCAATCCTCC 

CAGCTAAAGT 

TCTAGCTGTA 

TAAAAATTAA 

ATAGTTGCAT 

GAAAGTTACC 

AAAGAGAGAA 

GTTACCAGGC 

GGCAGATCAC 

TCTACTAAAA 

TCAGGAGGCT 

GATTACACCA 

AAAGAATAAT 

TGATCTGTGA 

TGGTATAATG 

TTTCCCCTAT 

CAACCCCTAT 

ATCAGACACT 

CATTTACAAG 

CCCTAGTAAT 

CCCTCTGAAA 

CATACATATT 

AGTATTTATA 

TGTGATTCTA 

CTTTTGTGAG 

CCTACACCAT 

AACAACAACC 

TGTTTTTGCT 

ATACCTCTTC 

AGAGACC CAA 

GGTTTATCCT 

CAGAGCAGAG 

ACTGGGGAGT 

TCAGAGTGAT 

GATGGGGGAG 

TCTGAGAACC 

ACCCCTACTC 

GCCTAGAATA 

ATTTGATCAG 

GCTATTTCCT 

GATGGGGCTC 

TTCTGAATCT 

GAGCTGTAAA 

AAATATATTC 

TAGCCTGGAC 

CCTCTGTTGC 

TGAAACTACT 

TATCATTTGC 

CTTTTACAGC 

AACACAGAGT 

GATCTTTCTA 



TGTGATCCAC 

CAGCCTCTTT 

GGGCAGTGGC 

TGCCTCACCC 

CTTCTCTCCA 

TAGCTAATAC 

AATGAAAAAA 

GTGACTAGTG 

TTGGACCAAG 

CTCAGGGAGT 

CAGCACGGTG 

CTGAGGTCAG 

ATACAAAAAG 

GAGGAAGGAG 

CTGCACTCCA 

TGGTACCAGA 

CAGATTCCAT 

AGTATTTTGA 

CTACATAAAG 

TATCTCATTT 

TTCACAAAAT 

TTAAACTCTG 

CATTTACTGC 

TAAATATGTA 

TATACATACA 

AATAAGGCTA 

GCCCATGTAC 

TCGATTTTTC 

CATGACAATA 

AACACTGGTT 

TTCAGGAGCA 

TGCAGAGAGG 

CCAGGAGGAT 

TTTTGTGCCC 

GGTTTGCTCC 

TTACTTTTAC 

TGCAACATAA 

GCCAGGGATA 

TAGCTGGGCT 

CTGGGAGCTC 

AAAAACAGTA 

TCTCTTGAAG 

CCAGTGATAC 

CCCTCCTTCC 

TCTGAGATTT 

ACAAAGAAAG 

CCCACCCCAC 

CTTTTCCGTA 

TGCCACTTCC 

GTGGGAGGCG 

ACTCCGCCTC 

AATAAAAAGA 

ATCTAAAAAC 

CTTTTGCCAC 



CCGCCTCAGC 

TTCTTTTCTT 

ATGACCACAG 

TGGCAGAGTG 

GAAAGAAGAA 

AGTAGCCACT 

TTCAGTTTTT 

GCTACATAAC 

TGCTGGGAGA 

GTGAAACTCT 

GCTCACGCCT 

GAGTTTGAGA 

TTAG CTAGAT 

AATGACTTGA 

GCCTGGGTGA 

ATTACTCTTT 

TGAAGGAGTA 

ATTAAAGGCC 

ACCAGTCACA 

TGTACTGAAG 

AATGTCTGTC 

TTCCTCCATT 

CCCTCAAAGA 

TACATGTATA 

TACATATGCA 

TATAAGTATC 

TTGTTAATAA 

AGTGAACTTC 

AAATTTGACT 

AATAAGGTCG 

GAGGTATAAT 

GGTGCCTAAG 

AATGGAAGCA 

TCTCCCTAAA 

TGAGGAGCAG 

TATCTCTTCT 

AGAGTTTGCA 

GGCAACAGAG 

CTCCCTGTAC 

CTCTAGGACA 

ATTTAACTAC 

C C AC AC AGAA 

ACCAGGCCCC 

AAGGCTCCAG 

GGTGTAAAGT 

TC CTGAC CAT 

TTG C CATC AG 

ACCTTCACTG 

TGGGTCAGGA 

GGGCTCATAA 

TCCGGTAAAG 

AGGAACCAAT 

AGGAAGCCTG 

CAACTCCCTT 
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GGGAGTCCTT 
GTTCTATAAC 
TTGGCATCAC 
TGTTTTTCTC 
CAGGATAGAA 
GTGGGATCCC 
AGGGTCAAGT 
CTATCACCAC 
ATAAGAGCTT 
TAAATGCTGG 
CAGGTGGGAG 
CCTGCCTGCT 
GGTCCAACTA 
GCAGAAGACT 
CACCTACGAT 
TTCTACTCCT 
GGTCCCACGA 
ACCCAGGCAT 
TCCATTTTGA 
AAGTTAGGCA 
AGATCTTGCT 
ATGGCAACAA 
TTAGCAAGTT 
TCAGGAAGCT 
ACATCTTTCC 
AACTGTAAGT 
AGTAATTTAT 
CTCACAACCA 
CAAAGAGCTT 
GACTATCATG 
TACCTCCCAC 
CACAGCCAAA 
CGTTCTTTTA 
TTTCTCACAC 
AACACTATCA 
ATTACCAATA 
ACCTGTTTCC 
G C AG AGATGG 
AATCAACCTG 
GATCCTGTGA 
GTCTCCTTCA 
ATTCCTAGAA 
AC CACTTGTT 
CCCATGGCAC 
ATGAGAATGC 
GAATATTTCT 
AACACACATC 
GTTGTGTGGG 
TAGTGTGATG 
GATGTTAATA 
CAGAGAGCTG 
AAAAGAAGGC 
GAAAAGAGAG 
AAAAATCAAC 



AAGCCTTCCT 
TGCTCTGAAG 
AAAATTCATC 
CTTGCTGTCC 
AATGGGAGTC 
TGGGACCATC 
CCTTAATGAC 
CCGTTAGAAT 
GTCACATTTA 
CTTAAGTAGA 
GAGAAATTAA 
GACCCTGATA 
CTCAAAATAA 
CTTCAGCACT 
TCATCATGCC 
GAGGCAACCA 
TGAGTCCTTG 
GGCTCATTCC 
ATGGGGGCTA 
TTGTAAGTCA 
GATAAAACAG 
GAGTGGCCTC 
AGAAGGCATT 
ACCCTATATA 
CAGAAAACAT 
ATCTGTATTA 
AC CAGGAAAA 
CAGCAGAAGG 
GTGCAGGGAA 
AGAACAGCAG 
CAGGTCCCTC 
CCATATCAGT 
TTCCTTTACT 
GAGATCTAAG 
AAGGATCAGG 
ATAACAGCAA 
CATTGCAGGA 
GGTGCAGATG 
AACTATCCCC 
TTATGGGATA 
GAAGCTCTGT 
TGCCTTTCCT 
AGCAAGTGTA 
TTATCACTGA 
AAAGGGATGG 
AAGCATAGTG 
ACAAGAGGTG 
GTATGTGGTA 
CATGTGATGT 
TGTCAAATGG 
CTTCTCTCCC 
TGTGACCTAC 
TGGAGGGGCC 
TCACAAATTT 



AGCTGATGTT 
GGTGTGGTGG 
TGAGTCAGCT 
TTCTGCAGGA 
ACTAGTGGCC 
ACTCTGCTTT 
TTAGCTCCAG 
TATTATTTCA 
TAAGTCTCAG 
TGCAGTGGTC 
ACTTGAATTC 
GCCAATGGAA 
AGGCAAGATT 
GCACCCTCCT 
CTGGCATGAT 
GCACACAGAG 
CAGATATCTA 
AGATCTGTCC 
GGTAAAATAA 
CAGGATGAAA 
GTTGCAGTAA 
TAGTCATTCT 
CCCACCAGCT 
GTCTAAAAAG 
ATGAATAATC 
GTCCATTTTC 
AATGTTTCAT 
CAAGGAGGAG 
ATTCCTTTCT 
TATAAATTAC 
CCACAATATG 
ATCCTTAGTC 
TTCTTGCTTT 
AACCTTCTCT 
AAAAGGAAGC 
GACAAAAGCA 
AGGAGGGGCT 
AGACTTCAGG 
AAGGAGGAAT 
TAGGAGTCCA 
ACTGTGTGTT 
CAACTCCTTC 
CCCCTCTCTC 
ATATTTTACT 
ATTTTTTTCA 
ATGTGCATTA 
TGTGCACATA 
CTGTGTGTGC 
GGTATGTGTG 
TACTAGAAAG 
ACCTGTAGGA 
TGGGCTGAGG 
AAGGGAAATT 
ATTAACATGT 



ACTTCTTTTG 
AAAAAGGGGT 
TTCTATTCTT 
CTCAGATCTT 
CAGCAGTGAG 
GTGCTTTGTG 
CTTCTCCACT 
TGGGGAAAAA 
GTGTAAGAGG 
CAAGGGAACC 
TGGGAGCCAC 
CATGGAGTTT 
GGGAAACACG 
GGGTGCTCAC 
GGTTGCAGAC 
AGAGGAGAAA 
CAACTTTCAT 
TATTGTCAGA 
GGCTGAGACC 
TAGGCAGTTG 
AGAAGCTGAC 
CATTGCTCAT 
CCATAGTGGT 
GGGAGGAACG 
CACTCCTTGT 
ACACTGCTGA 
GCTCTTACAG 
CAAGTCAGGT 
ATAAAACCAT 
TCAGGGAAAG 
TGGGAATTTA 
CAGAAGCTGA 
CACTTTACTG 
TAGGGTCTGG 
TAGTGAATGC 
AAACGGATTG 
GGTTCATGCA 
AATATGTTGA 
GCATTATCTC 
AAGACTCACA 
CCCACTGTGG 
AGATTTTGCC 
CCTCCCAAAC 
AATTTATTTT 
ATATTGTTCA 
AATCAAAGAG 
TGTGCATGAT 
TGTGTGTGGT 
TGCGTGTCCA 
TATCAGAACT 
TATACTGATG 
AAATAAAAAC 
TCCCCTTTGG 
ACACAGGGAG 



CTATTTATGG 
GGTAACAGCA 
CTCTGTCCCG 
CTTCAATAGC 
TGCCCCCAGC 
GAGAAAAGGC 
TCAAAATGAA 
AGATGGATTA 
CATTTATGAT 
AGTAAGGGGA 
TGGCCTGTCT 
GGCCCAGCTG 
TTCCTTTCTT 
AGAGCCTTCT 
CCCATGC ATA 
GAATGAGCCC 
TGTTGTGGAT 
GGTGTTCAAA 
TACTGGGCTG 
GCACAAGACA 
CAAAACCCAC 
TATACACGAA 
TTATAAATAC 
CTTGGTTCTG 
TTAGTACATA 
TCCAGACATA 
TCCCACGTGT 
CTTACATGGA 
CAGGTCTCAT 
ACCTGCCCCC 
AGATGAGAGT 
TGCTCTGCCT 
TGTAGACTTG 
GTTGGGACCC 
TAAAAAGGAA 
TGACAGCTGT 
CAGAGTGGCC 
CAAAGGCAGG 
TAATATGTAA 
ATGGGAAGTA 
GCAAGAGTCA 
TCTCAACTAA 
ATTTTCAAAT 
GTTTAGTGTT 
CTGATGAATC 
TAACTTTCTG 
GCACGTAGTG 
ATGTGATACA 
TACATATTAG 
CATGGTGCTT 
GTTTGGACAG 
GAAAGTAAAA 
CTTCTGGGGA 
AACCATAGAA 



GTTGCTTGTG 
GTAGGACTCA 
TTCTGTGTCT 
GAGGGTCAGC 
TTAGAGCTGT 
TGTGGGGTCC 
AGGAAAAGTA 
CTATCTCACA 
AACAACATAA 
GCTCAGGACA 
GGGCCCCTGG 
CAATCCCTCT 
CCTATACCAA 
GTTGTTTTGC 
GCATGGGACA 
CTGAATCCTT 
GTGACTCTGT 
C C AGAATG AC 
CATTCCCAGG 
CAGGTCATAA 
CAAAATCAAG 
TTATAATGTG 
CATGGCGATG 
GGAATTGCCC 
ATCAAGAAAT 
CCTGAGACTG 
CTGGGGAGAC 
TGGCAGCAGG 
GAAACTTATT 
ATGATTCAAT 
TAGGTGGGGA 
GTAGAGTAGC 
CCCCAAATTC 
CCTTTCTGGT 
ACAAACTACC 
CCCATCTCAC 
AATATTAGAA 
CCTAGGGAGA 
AGTTAGGCTT 
GGTCACTAGA 
GCACTCAGCT 
CCCTATCCTG 
CTATTTTGTT 
TGCTTCCCTC 
CCAGTAACTA 
AATTGCACTA 
TGGTGTGGGT 
TAGTTTGTGT 
GGGTGGCGGG 
ACTGGTTTCC 
AGAAGAAATA 
GAAGAGCTGG 
AACTTTGCTG 
TGATTATCCA 
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CTTCCCAAGA 

AGAAGAGAAA 

GTCCGGGTTT 

GAGTAGAATT 

TTTCTGGTTT 

CACCCTGTCC 

CCTGAACTGT 

CCCTGTGCAA 

AGAGGTTCTC 

CATTACTAGG 

AAGTCCGTTC 

CTCTTTTTGG 

TGGTAGGGAT 

TGACCAAGGG 

TGTG AT AG C A 

CTGTCACCCT 

TCAAATGATC 

ACCCAGCCTG 

GTAAGCACTA 

AC C AAAA AAA 

TGAGGTGGGG 

ATGCTGTCTC 

GACAAAGGAT 

TGTGTTGCCC 

AGGCTCAAGC 

CACCACGCCT 

GGCAGGTCTT 

GGATTACAGG 

TACCCGTATC 

TGAGTATTTT 

GTACATAAAA 

ATCTCATTGT 

TAAGATAATC 

TGTTCTTTGA 

CTTCTTCTGT 

CAAGGTGGCA 

AC C AC AG AGT 

ATTTTTCAGC 

CCAACCATCT 

GGCTCAGGCC 

ACATCAACAC 

TGCTAGCTAA 

CTAGTGCTCC 

CCACCCCACT 

TTCTTAAACG 

TCCCAACCAC 

TCACTCAAAT 

ATGACACAGA 

ATAATTTTCC 

ATGCATTGAA 

TGCCAGTACT 

TAACTGAAAA 

CTGGGGATGG 

CCAGACCCCA 



GGGCTTAAAA 

CTCTGTTGAT 

TTGTCTCACA 

TATTAAGTGA 

GCTCTTCACA 

TACCAGTGCA 

GCATGTGTTA 

GTTCCCTTAT 

TGGGTACCAT 

GAGAGTAAGT 

GGGCATGGTT 

TGGGTCTGGA 

AGTTGGTGGC 

CCATATATTA 

TCACAGTGTG 

GGCTGAAGTA 

CTCCCATCTA 

AAACTGCACC 

CTCAGAGGTA 

ACCAGACCGG 

GGAGTCACTG 

TATTAAAAAA 

GTCCCAAAGT 

AGGCTGAAGT 

AATTCTCCCT 

GGCTAATTTT 

GAACTCCTGA 

CGTGAGTCAC 

TCTAAAAGGG 

GAATGAAAAA 

ATAGGATGGC 

GCATTATAGG 

AGTCTCTAAG 

TCCAATTAGT 

TTCCCATAAC 

TATAAGCTTC 

AATTAAATTG 

AAAACTTCCA 

GAATAGACTT 

ATTAAGGGAA 

AG CTTTT AAG 

AAACTTCATC 

TTTCTATTAA 

CCTCCGCTGC 

TATTTGATTG 

CTTGAGCGCT 

TTGGCTCAGA 

TGACTGCTTC 

CCGGATAGCC 

CACTTGTTTT 

CCTGATTTGT 

ACAGAAAATA 

TGGCTCACTC 

AGAGAGGGTT 



G CTTATAT AT 
GGGATTACTG 
GCCAGGAAGA 
AAGGAAAGCT 
GTTGAATACT 
TGCAGGCCTT 
AGGAAAGGAA 
CTGCACAAAA 
TCCCTTACTG 
AGATCAGGGA 
ACATTCTTGG 
TCTTAGGTAG 
AGGGATGTCA 
GGGTATCAAT 
AAAGCAATTT 
CAATGGTACG 
AGCATTTCAA 
CACTTTCTGA 
GGAAGAAAGG 
TGTGGTGGCT 
GAGGCCAGGA 
AAAAAATACC 
AAGTCTTAGC 
TCAGTGGCGT 
GCCTTCAGCC 
TGTTTTTTTT 
CCTCAAGTGA 
TGCACCCGGC 
AGTAGTGAAT 
CTC TTAGAGA 
CCCACCAGCG 
AAAGACCAAG 
CATCATTTAA 
CTCTCCTGGT 
CTATTTTGCA 
TAAATTCCAC 
TAAAG CCTTT 
AGGGCAAAGG 
TCTCTTCAGT 
GTGGGGGTTG 
TCTGATAAGA 
CCATAGTACA 
TCCCAAATCT 
TTCCAGTTGT 
ATGTCCCATG 
TGTTCTCAGG 
ATAAATCTCT 
ACTGAAGCCT 
CCAGAAGCAG 
GTGCCAGACC 
TAATACATTC 
AATAAAAATA 
ACACCTGTAA 
CTTGGATCTC 



TATCCTGGCA 
TTGCGGATTT 
ATTAGGCATG 
CTCAGCAAAG 
AGGGCTTAAG 
TAGACTG AG C 
TCATCCACTG 
CATCCGGTGT 
TCTGCCTAAA 
ACAGAGATTA 
TCTTACAGGA 
ATAAAGAAAC 
GAGAGACTTT 
TTCTGAGCCC 
TTTGTTTGTT 
ATCACAGCTC 
AGTGTTGGGA 
TAAACTTTTC 
ACACAGGATT 
CACACCTGTA 
GTTCGAGACG 
TGCCTTGAGC 
ATTTTTTTTT 
GATCTCGGCT 
TCCCAAGTAG 
AATAGAGATG 
TCTGCCCACC 
AAAGTCTTAG 
TTCACCCCAA 
TCAACAGACA 
AGAACAATTG 
AATGTAACCA 
ATTCCAAGGA 
AGTTACATAT 
AGGATCAAGC 
TGGGATATTG 
TATCTTATGA 
TATAAAACAA 
CAGGCTTCTT 
AACATGCCTC 
AACATTTTAC 
ACTTTGG TCT 
TTATACAAAC 
CCCGCCTCTC 
CCTCCCTAAA 
ACCTCCTGAG 
TCAAATGTTT 
GCTCTGGAAG 
CTAGTAATAA 
TATGTCAACA 
TAAATAAAAA 
TATAATAACT 
TCCTGTTACC 
ACACAAGAAA 



AAACAGATTA 
TTGCTCCTTC 
CAGCCATCAA 
ACAAGGGTCC 
ACTCAAATTC 
TACTCCATAT 
CAGGCATGTT 
AAGCACTTGT 
GCAAGCTGGC 
ACTTGAACAT 
AGGGTAAATA 
TTTAATTCCA 
GAGGCTTCTT 
CAACAAGAGC 
TTTAGAGACA 
ACTGTAATCT 
TTACAGGCAT 
AAATGACTAA 
ATAGGATTAA 
ATCACAGCAC 
AGCCTGGCCA 
TAATCAGAAT 
TTTTTTTGAG 
CACTGCAACA 
CTGGGATTAC 
GGGTTTTGCC 
TTGGCCCCTC 
CATTCTTTAC 
AATGTGGCTT 
CTAAAGAGAC 
TTCTTTTCTC 
CACCTGAACA 
GAACTATTTA 
TGCCCCTCAA 
CCCTGTTATT 
GTACTATGTG 
ATCTGCCTTT 
AAATAAAATT 
AAAATGTAAC 
ATTATTCCTC 
AACCTATTCT 
TCACAACCTG 
TCAACCAATT 
TGGACCAAAC 
ATGTATAAAG 
GGCTGTGTCA 
TACAGAGTTT 
TGAGTGGGGG 
TACACTTAAA 
TTTGCTTTGT 
TTCTGGAGTT 
GAAATAAAAA 
GGAAAGGGGT 
GAATTCGGGC 



TGGGAGGGGA 
GCTCAGCTAG 
AGAATGAGTG 
TGAAAGCAGA 
CTGACAACTC 
TGATTAATTT 
TAGGCAAGCC 
GGGGCAGGTC 
CAACTCCTTT 
TATCTTGTGA 
AAAATAATTG 
CGATGTGTTT 
CAGTTCAATA 
TTAGGAGAGA 
GGCTCTTGCA 
TGAACTGGGT 
GAGCCACGGT 
AGGGGAGAGA 
AACAACAACC 
TTGGGGAGGC 
ACATAGCAAG 
CATGGACCCT 
ACAGTCTCGC 
GCTGCCTCCC 
AGATGCCCAC 
ATGTTAACCA 
CATAGTGCTG 
AAACAGTTTG 
CCTGATATAA 
TTTTCCCTAG 
CCTCTCTGTT 
GACCCTTTTA 
CAAATTTATC 
CAGAATTCCT 
TCTTCAACTT 
CATGAGGAGA 
TTTTGTGTTC 
CTAAAGCCCC 
CTGAAAGACT 
TCTGGCATTA 
CTCTGAAGCC 
TTATCACAAC 
GTCATCACCT 
CAGTGTACAT 
CCAAGGTGCA 
TGGGCCATGG 
GGCTCTTGTC 
TTTTG C AAGG 
GGT AG CTAAA 
GCCAGGCTTA 
TCAAATATAA 
TTTACTAAGG 
CCGTCCAGAT 
GAGTCTGTAA 
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AGTGAAAGCA 

GCAGCTCTGA 

AAGGGGTGGA 

CCATGGCATT 

GTCACTCTCA 

TCATCGCCAT 

TTTGCCCAGG 

TTCAAGCGAT 

CACCCAGCTA 

ATCTCCAACT 

TAGGTGTGAG 

AAGGTCTTTA 

CTAACTTACA 

AGATGGAGTC 

AGGAGGATTG 

CTCTAAAAAA 

GCTACTCAGG 

CAGAGGCATT 

CACCTGCTAG 

GTCAGCACAA 

TGGCCAAAAC 

TCATTATATG 

CAGCTTACAA 

AACCCTCAAT 

TTGTTTAGCA 

TGTTCTGCCT 

ACTTTACTGT 

GGGGTCTGGA 

CTGAGGAGAC 

AGTGGTGGAG 

GGGTCTCTCC 

ACCGAACTTG 

AGAGGCCCCT 

AACTGCTATT 

GGATTAGGCA 

GCTTGACAGC 

ACTTTTGATT 

CCTTCCCCAT 

TTTTCTGTTA 

AAGTTTGATT 

ATTGGGTGCT 

GAAATGGAAA 

GAGAGACATT 

CGTAGCTTGG 

GAACCCAGAA 

GCCTCTCTCT 

AAGTTTTGAT 

TATACTTTGT 

GGATAGAACA 

TCCGTTGCAA 

TTCCTTCTCA 

TTCTCTTGGT 

TAAAAATTAT 

GGGCAATGGA 



AGTTTATTAA 
GGGCTGCTGG 
TAATTCATGC 
CGTAAACTGT 
TCGCCATCTT 
CTTGGATTTG 
CTGGAGTGCA 
TCTCGTGCCT 
ATTTTTTATA 
CCTGCGCTCA 
CCACCCCACC 
TGACCTGTAT 
GGGAATGCAG 
TTTCTTGTTC 
CTTTAGCCTA 
AAAAATACAA 
AGGCTGAAGT 
TGAACCAGAA 
GCTGCATTTC 
GGTACACATC 
CCATCAAAAC 
TTAATTATAA 
ATACTGCGGC 
TTTGGGAATT 
CATAATCCAG 
AC AG AGT AG C 
ATGGACTTGC 
TCAAGACCCC 
TCTGAAGCCA 
TCCCCGGGTA 
TAAGACAGAG 
GGTTTGAGAC 
CTCAGTAAAG 
CTGTTTGTAT 
TGTACAGGAT 
TGATAGGACT 
CAGTGTTGCT 
CTCACCACAG 
CTTGAGACAA 
AAAGATGAAA 
AAGTGGAGTG 
ACGTTAATTT 
TATTTGCCTG 
ACCCAAGGGC 
ACCTGGCATG 
CTTAGTGAAA 
TAATGGGAAC 
GATATCAATT 
TGGGCTCAGG 
AGTTTATTAC 
TCTTGTTTTA 
CTCTGCCATC 
CTCAAGCCAT 
AACTAAC C AG 



GAAAGTAGAG 
TCGCTCATTT 
CTCCATTTTT 
CGTGGCGCTG 
GGATTTGGTG 
GTGGGGTTTA 
GTGGCACGAT 
CAGCCTCCCA 
TTTTTAATAG 
AGCCATCCAG 
TGGCCTAGCC 
TTTGTGCCCA 
CCCAGCAGGA 
AAATACCTCT 
GGAGCTCAAG 
AAAAATTAG C 
GGGAAGATGG 
TGACTCTATC 
CAGTATGTTA 
ACAAAGACCT 
CAACATGGCC 
TGTATTAACA 
AATATCTGGA 
GTCCACCCCT 
AAATAACTAT 
CATTCTTTTA 
CCTAAATTCT 
TTTCTGGTAA 
AAGGAAACAG 
AAGGATAGGA 
AGGGTTTCAG 
CCAACTTAGG 
TCTCTCTTGG 
TAATCTTCCC 
CACGGGACAT 
GTTGGAAAAG 
GCAATGGGTG 
GCAATGCTTT 
CCATCTTGCC 
GGGCCTATCT 
GCCAATGTCT 
GGTTACTTTA 
TGGTTCCATG 
TTTGCAGTGA 
CCAGCAAAAG 
ACTGAATGAA 
AAGGATTTGT 
TGTCTTTCTG 
ACTCCATAAG 
AGGTCCCTGG 
TGTCCTTTGG 
CAGGGAACAG 
TGCAAGCTCA 
TGTTGTAGCT 



GAATAAAAGA 
TTATGGTTAT 
TAGACCATAT 
GTATG AG CAT 
GGGAGCAGTG 
GCCAGCTTCT 
CTCAGCTCAC 
AGTAGCTGGG 
AGACCGGGTT 
CCACCTTAGC 
GGCTTCTTTA 
CTGCCTGCCT 
CTCAGCCTTA 
GACAAG C CCA 
ACCAGCCTGG 
CAGGCATGAT 
CTTCAGCCCA 
TTGAATAGGC 
GGCATTCTTA 
TGC TG AT AAA 
ACCAAAGGGA 
TGCTAAAAGA 
CTTTACCTTA 
TTTTTGGAAT 
AAGTATGCTT 
TTTCCTTACT 
TTCTTGTGTG 
CATCTTTCTG 
ACTACAGCAC 
TTGGGTTAGA 
TCCGCTCTTA 
AAGGCTACAG 
TTAAAAACGG 
TGTGCTCTTT 
TGGGAACTTT 
ATCCCTTTGC 
GGTCTTTCTC 
TCTCCCTTTC 
CAGAGACCAT 
GGGGGCAAGT 
ATGTTTTGTC 
TGTGGCCATT 
AAACAGAAAA 
GCAAGGTTGC 
GGTAAAGATT 
TGGTAAAAAT 
GGGGCTAGTC 
TATTACTCTG 
CCTGCTGTTC 
AAAAAAAAAA 
AGCTTCACCT 
GAATTTTGGG 
AAATTGGCTG 
CAGCAGCTAA 



ACGGCTACTC 
TTCTTGATTA 
AAAGTAACTT 
AGCAGTGAGG 
AGGATGACCA 
TTACTTTTTT 
TGAAACCTCC 
ATTACAGGCA 
TCGCCATGTT 
CTCCCAAAGT 
CTGCAACCTG 
CATCCTGTGG 
TTTCACCCAG 
ACACTTTGGG 
GCAACACAGT 
GGTGTGTGCC 
GGAATTCAAG 
GCTGGATAAA 
GTCACAGGAT 
ATAGGTTGTG 
CCTCTGGTTG 
CACTCCTACC 
TATGGTCTAA 
GCTCATGAAT 
ATTTGAGCAG 
TTCTTAATAA 
AGATCCAAGA 
GTGACCACGA 
CAACTGGCTG 
GGTGCAACTT 
ATAAAGGG C A 
TCCTTAAGAT 
ATTTAGCATT 
GCTGACAGCT 
TCTTCTCTCC 
TATGACAAGC 
TGGCCTCTGT 
TCTCTTTTCT 
ATGTTGAAAC 
TTGAGCCTTC 
ACATGTATAT 
GGGCAGCATC 
AAGTTGGTTT 
TAGTGCTGCT 
TCTTACCAGT 
CACTGTTTAT 
TTAAGCTGTA 
TCATAAAGAG 
AAGCCAGCCC 
AAATAAAAAC 
TGTAAC C ACG 
GTTTATGTAA 
CTCTGGACCC 
GGATTTGTCA 



CATAGGCAGA 
TGTG CTAAAC 
CCTGACGTTG 
ACGACCAGAG 
GAGGTCACTC 
CTTTTTTTTT 
AATTTCTGAG 
TGTGCCACCA 
GCCTACGCTG 
GCTGGGCTTA 
TTTTATC AG C 
CTTACAATGC 
CTCCTATTCA 
AGGATGACAC 
GAGACCCCAT 
TGTAGTCCCT 
GCTGCATTGT 
ATAAGGCTGA 
GAGATAGGAA 
GTAAAGAAGT 
TCTTCACTGC 
AGCATCATGA 
AAGGTGGAGG 
AATCCACCCC 
ACCACGCTGC 
ACCTGCTTTC 
ACCCTCTCTT 
AGGGACAATA 
ACTTTGGGTA 
AGGGGAGATA 
AGAATG CTTG 
TTAAGGGGTT 
AGGGGATGTT 
ATGGGTG AC A 
AAAAGGGGAA 
AGCCGCCTGA 
GAACTCCTCA 
CTTTTCTGTC 
TCCTGGTCAG 
CCAGTTAGAT 
TGCTCTGGCT 
TTACAAAAGT 
TCTTTTGTGT 
CAGTGAAAGA 
CAGGCTTCTG 
CACCTCTGTA 
ATGAATCTGG 
GAATATGGTA 
AGTAAACTGG 
TGGATGAAGT 
TGGCGGTACT 
TAGTTAACTC 
CTTCTGGGAA 
TTTTATAATG 
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GCGGCCAAGG 

CTTTACCATT 

TCTGAGAACC 

GGATGGCTAA 

TGCTATGGTT 

GACTCCTTGG 

CCTCATGAAA 

TGGCTTTGCA 

ATTATGAGGG 

AAGGCAAATA 

TGACTACCTA 

AGATGGCTGA 

AAAATGGGAC 

CTCAGAGTTA 

GACACAAGAG 

TTATAGAAGA 

GGGAGTGGGA 

TGGGTAAAAT 

CACAGGATGA 

GGTAACGGTA 
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GTCTAAAACG 

TTGAATAATC 

CAGTCCATAC 

AGATAAAGAC 

CACTGCAACC 

GGGACCACAG 

TTTCGCCATG 

GCCTCCCAAA 

CTTAACTTTT 

GAGTGCAGTG 

TCTGCCTCAG 

TTGTATTTTT 

CCTCAAGTGG 

GCGCTCGGCC 

CCCAAATTCC 

GCTGTTCAGG 

CTGTGATCCC 

CTGTAAGAAA 

AATCAGTGAA 

CAGAATGAAG 

TATGCATAAT 

AGAAATAATT 

TTAGGAGCTC 

ATAATCAGAT 

TCATCACATA 

ATAAACTTTA 

GTGTCAATTC 

TAATATTCAT 

AGTTGGAGAC 

CACTTGAAAA 

G C AG AATAAA 

CTTCATGAAA 



TTCAATCCTG 

TGTTGATTCT 

TGGGAGATTA 

AGTTGAGTAA 

TATGGTTAAA 

GAAAAGCAGA 

CCCCAGGAAC 

TTATGTTATC 

AGATCTGGTG 

TAGGTGAATA 

GAAGGTATGG 

TCCCCCAAAA 

CCTGGCCAGG 

TGAATGTCTC 

ACC CTAATAA 

CGGATCTTTA 

AAATATGTCA 

AAGGCTGAGG 

GATAGAAGGT 

AAGAAGCCAG 

TCACTGCTCA 

CCACGACAGT 

GGGAAGAACC 

CAT T AGTTT A 

TGCTGCTCTG 

TCGCTCTGTC 

TTCACCTCCC 

GTGGGTGCCA 

TTGGCCAGGC 

GTGCTAGGAT 

TTTTGTTTTT 

GTGCGATCTT 

TCTCCTGAGG 

AGTAGAGACA 

TCTGCCTGCC 

CTTCTTTACT 

TTCTTGTGTG 

CTGGAGCAAG 

AGTAAGAGGA 

AAAAAATTAC 

CTCAAAGATA 

AAAAAGAAAT 

GGGACTCCTA 

TCTCAAAGCT 

AATGAATTCC 

TATCAAAAGA 

CAAATAGTAC 

AGATAAGCAT 

ATCAAGAAAA 

GAAACAAAAC 

ATCAATACCT 

CACCTAACCT 

CATCCTTCTG 

TAAGTCTCAA 



GCTTAGGGAA 

GTTCTCTTCC 

TCTTTGGTAA 

TAAGAAACTT 

AGCTTAATTA 

GGAGGCACCA 

TGGAAGTGGA 

TGATGTTTTT 

TGTAATAACC 

CTTGGCTATT 

AAATGTCTCC 

GAGGGCTGAT 

CACAGTGGCT 

ACCATACTGA 

TTAGACAGGA 

TCCCACTGCA 

GAGGCATTTG 

CCTGCTGGGT 

TGCACAAGGT 

CTAAAGCCCA 

TATACACTAA 

TTACAAATAC 

CTTAGTTCTG 

GCACATAATC 

CCTATGGAGT 

ACTCAGGCTG 

GGGTTCAAGC 

CCATGCCTGG 

TGGTCTCGAA 

TACAGGCATT 

TTGAGACAGA 

GGTTCACTGC 

AGCTGGGACT 

GTGTCTTGCC 

TCAGCCTCCC 

TTCTTAATAA 

AGATCCAATA 

CTGGAGCTCA 

TCATGGTCAC 

TAGAGGGCTT 

GGTCAATTGA 

AGAG CCTTAG 

GAAGGAGAAA 

TCCCATGTTT 

AAGTAGGATA 

TGAAGAAGAT 

TCAAAAGATG 

TGTTATAATA 

CATAACATTA 

TGACAGAATT 

CACTAGTTAG 

GACC CTAACA 

AAGCTCACAT 

TAAATGTAAA 



TGAGTACTTT 

CCTCCACACA 

AGTTCAAAAG 

AAAAGGACTC 

AAAGTGGATA 

CAGACCCCAT 

TAGATCCTTC 

GACTTTTGGG 

AGGTAGGAAA 

TGCACTTTTG 

ATCCCCACCG 

TCCCTCTTTT 

CACGCCTGTA 

CACTTTGTGA 

ATATCATTGC 

ATCCTTAGGA 

AATCAGAGTG 

TAGGTTAGGC 

ACCCGTCACA 

CCAAAACCAA 

TTATACTGCA 

CATGACAACA 

GGAATTGTCC 

CAGAAATAAC 

AGCCATTCTT 

GAGTCTG GAG 

AATTCTCCTG 

CTAATTTTTG 

CTCCTGGCCT 

ACCCACTATG 

GTCTCACTCT 

AACCTCTGCC 

ACAGACATGT 

ATGTTTGTCA 

AAAGTGCTGT 

ACTTGTTTTC 

ACCCTTTTGT 

TGCTGCTGCT 

TCCAGCCTGA 

TAACAGCAAA 

AATGATCTAC 

AGACAGGGGA 

AGTGAGAGGA 

GGCAAAAAAG 

CACTCAAAGA 

GAATCTTGAG 

TCTGGAGTAG 

AATAAAGAAA 

TAAACATACA 

GAAGGGAGAA 

ACAAGATCAA 

TAAATCTATA 

GAAACATTTT 

AGGACTATAA 



CTGATTGATA 

CTGTCTTGAG 

CCAGAAATAA 

CTTTTTTTTT 

TTCAATCTCT 

TTTGGGAAAA 

GCAAAATCTA 

GGTATCAGAA 

TATACTTCTG 

GATCACAAGA 

AGAGATAAGA 

GGGATCCAGG 

ATCTCAACAC 

CTGAGCTCCT 

CCCTATTTAG 

TTAAGGGTTC 

ACTCCATCTT 

ATTCTAACCA 

AAGACCTTGC 

CATGGCCACA 

TTAGCATGCT 

TCTGGACGTT 

ACCTCTTTCC 

TATACGTCTG 

TTCTTTTATT 

TGCAGTGACG 

CCTCAGCCTC 

TATTATTAGT 

CAAGCGATCC 

CATGACCCAT 

GTCACCCAGG 

TCCTGGGTTC 

GCCACTACAC 

GGCTTGTCTC 

GATTACAGGC 

ACTTTACTGT 

GTGTGAAAGA 

CAGACTGGAG 

ACGACAGCAT 

TTTGAGCAGC 

TCTGAAAAAC 

TACCATCAAG 

CAGGGAGAGA 

CATTAACTTG 

GATCCATACC 

AGCAGAAAGA 

GTATACTAAT 

GGTATTTTGT 

TGCACCTAAC 

ATAGAAAATT 

CAAAAAAATA 

GGTCACTACA 

TCAGGATAGA 

TAATAGAGTA 



TCTGTGTGAC 
TTTTCCTCTC 
TGGCCGTGTG 
TGCTTTAGAG 
AAAAGCCTGG 
CCTCTGTTTT 
AGGCTCTGTT 
ATTACTTTGC 
GGGATAGCTA 
AGCATTCTCT 
TTCCCAGGGG 
ATCTGGTATA 
TTTGGGAAGC 
CTCTACCCTG 
TCTGAAGAAG 
CCTGGTAAAA 
GAATAGGGGC 
GGAGTTTAGT 
TGATAAAATA 
AAAGTGACCT 
ACAAGACACT 
ACCTTATATG 
TGAAAAATTC 
CTTATTTGAG 
TTTATTTTTT 
TGTTTTGGCT 
CCAACTAGCT 
AGAGATGGGG 
ACTTGCCTTG 
TCTTTTATTT 
CTAGAGGCTG 
AAGCGATTCT 
CCAGCTAATT 
GAACTCCTAA 
ATAAATCACT 
ATGGACTAGC 
ATGTATTGCT 
CATGCGTGAT 
GATATCTCAT 
AAAAAGAAGT 
AGAAAGAAGA 
CATACTAATA 
GAATGTTTGG 
CATACATATT 
TAGACACATC 
AAGGAACAAT 
ATCAGACAAA 
AATGATAAAA 
AAC AG AG C C C 
CGACAATAAT 
GAAGACTTAA 
CCCCAAAACA 
CTGTATATTA 
TATATTCTCT 
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GACCAAAGTG 
TAATCCCAGC 
GCCTGACCAA 
GGTGGCATCT 
CCAGGAGGCA 
GAGCGAGACT 
AAACCCAAAG 
ATTATGTACA 
AAAGACAGTG 
AAAAACTCCC 
ATTTATCTAG 
TCTATTTCAT 
ATTGACACAG 
T AAACATC T A 
AATGATAAAA 
TAGAAATTGT 
TATTTTCACA 
GGCCGGGCAC 
TCACTTGAGG 
AAAAATACAA 
GCTGAGGCAC 
CCACTGTGCT 
AAGAAAAGTG 
TATAAGTGGT 
AGGCAACCAC 
AATAGAATGA 
GAACAAAGAA 
C CAT AT AG AT 
TGTCAGATTG 
TGCTGCCTAC 
TTTAACCTTA 
TATTTATTTA 
AGTGGCACCA 
TCAGGCCTCC 
ATTTTTAGTA 
AGCCTCCCAA 
TTCTTGCTAC 
TATTTTTCTG 
GGGGTTTGTG 
CTCAATTCAT 
TCACCCAGGC 
TCAAGCAATC 
GCCTGACTAA 
GTCTCGAGCT 
CAGGCATGAG 
AGTTGGCTTG 
TCTGTCTCTC 
AGCAGTTTCC 
CGCAGTATTA 
GGACTTTCAT 
CTACACACTG 
AGTACAATGT 
TGTTTAATTT 
TTATTTCTAC 



GAATGAAGAT 
ACTTTGGGAG 
CATGGTGAAA 
GCCTGTAGTC 
GAGATTGCAG 
CCGTCTCAAA 
CAAGCAAGAG 
GAAGAACAAG 
GAGCAAAATT 
CCCAAATGAG 
TCATATGTGA 
TTAAAAAATC 
TCATTAACTA 
ATCAGCAGAC 
GTCCCTTAGG 
AACTATGTGA 
AAAATAGTTA 
AAAGGTTCGT 
ACAGGAGTTC 
AAATTAACCA 
AAGAATCACT 
CCAGC CTAGG 
TATGACAACA 
ATACTTTTAG 
TGAAATAATG 
GAAAAAATAT 
CAGATGGGAC 
TATCACACTT 
AATTAAAAAA 
AAAAAATTCA 
TTTACTCAAA 
TTTATTTATT 
TCTAGGCTCA 
CAAGTAG CTG 
GAGACGGGGT 
AGTGCTGGGA 
GCTTCCTCCA 
GTAAAATTAC 
TCTGCTTTTG 
GAATTGAGAC 
TGGAGTGCAG 
CTTCTGCCTC 
TTTTTTAAAT 
CCTGGGCTCA 
CCTCTGTGGC 
TGGTTACAAC 
CCCACCTCTC 
AGAGAGTAGG 
CTGGGAGAAC 
TTTAGACTTT 
TTTAAGTTGC 
ATTTTGTAAT 
CCAAATATGT 
TGTAGTCAGA 



AGAAATCAAT 
GCCAAGGCGG 
CCCTGTCTCT 
CCAGCTACTC 
TG AG CTG AG A 
ATTAAAAAAA 
GAAAATGAAA 
TGTATAGATC 
TTTTAGATTA 
GGTGAAATAA 
GAGTTTTATG 
AACCGTGCAA 
AAATATTGTA 
TAGAACAATA 
CATATTGAAA 
GTAAACAGAT 
ACAATGTAAT 
GCCTGTAATC 
AAGACCAGCC 
GGCGTGGTGG 
TGAATCCAGG 
CAACAGAGCT 
ACAGTG C AAA 
ATGAACTACG 
AAACGAAGAA 
TTAAGTTGTT 
AAATGGGAAA 
AAGGTAAATG 
ACAGACAACA 
CTTTAATATA 
CCTCCTGATC 
TTTGAGACAG 
CTGCAGCCTC 
GGACTATAGG 
TTTGCCATGT 
TTACAGGCGT 
ATCCATTTTG 
TTATGGGTCA 
CCAGGAAGCT 
^"TTTTTTTTT 
CGGTGTGAAC 
ACCATTCTGT 
ATTTTTTTTA 
AGTGATCCTC 
TAGCCAAGAC 
TTATCAGGAT 
TCACATCCCT 
ATGGGATTAC 
ACAAGTATCT 
ATTTGTTTTC 
ATCCCATGAA 
TTTTTGTGAT 
GTGTTTTTTT 
TTTAATAATT 



AACTAGGCTG 
ACAGATCACG 
ACTAACAAAA 
GGGACACTGA 
TCGCGCCACT 
AAAAAGAAAC 
AATTTCAAAG 
ACATATTTCT 
ATGAAAGACC 
GACAATTTAA 
ATACATTTTG 
TTAAATGGTA 
GTATTTTTTT 
AAAAATATTT 
TTCCTATTTA 
AATATTTTTT 
GTGTGATTTA 
CCAGCACTTT 
TGGCTAACAT 
TGCACGCCTG 
AGGTGGAAGT 
AGACTCTGTC 
AGAAGTGGAA 
ATAAATTAAT 
TTATGGCTAA 
CAACAGATGG 
GTAATAGCAA 
ATCTAAATAC 
ACAAAAAAAA 
AAGACACAAA 
CCTATTTATT 
AGTCTGACTC 
TACCTCTCGG 
CACATGCCAC 
TGGCCAGGTT 
GAGCCACAGC 
TGCATTTGAT 
CTGAGGACTG 
GGGGTACCAC 
TTTTTTTTTT 
ATGGCTCACT 
ATAGCTAGGA 
GAGATGGGGC 
CCACCTTGGT 
TTTTTATTTT 
TGATGATCTC 
TGCTCTGCTG 
TTCTAGTTTA 
CTTATTAGAC 
T ACT AT AAG C 
TTTTGATGTG 
TTGTTTGGAG 
CTACATTTCT 
CATTTATTTT 



GGCGTGATGG 
AGGTCAGGAG 
TACAAAAATT 
GGCAGGAGAA 
GCATTCCAGC 
TAGAAAAATA 
CAGCCAAGAA 
CATAGACACA 
TACAATTCTG 
TACAGAGAAA 
TACTGTATAT 
GATTGTCTTG 
ATCTCCCTGC 
TTTAAAAGTC 
TACAAAGGAA 
CTCCATAAAA 
TAGCATTTAA 
TGGAGGCCGA 
GGCAAAACCC 
TAATCCCAGC 
TGCAGTGAGG 
ACACACACAC 
ATGAAAATAA 
GATGTATACT 
CAAGCCACAA 
GAAAAAAAAG 
GATGATAGAC 
TCTAATACAA 
GCAAAAAAAG 
TAGTCTAGAA 
TATTTATTTA 
TGTTGCCCAG 
GTTCAAGCGA 
CATGCCCAGC 
GGTCTCAAAC 
ACCCAGCTCC 
GATTTTGCCA 
GGATGTTCTT 
CAGTCAAGTA 
TTACGCAGAG 
GCAGCCTCAA 
CTACAGGTGT 
TCACTTTGTT 
CTCCCAAAGT 
TTAGC CTAAA 
TCTCTCTCTC 
AGAAGCAGAG 
CTTTTATCAT 
ATACCACCTT 
AATTTAAGTT 
CTTTATTGTC 
AGATTGATTA 
TATTTTTATT 
TATTATTTTC 



CTCACGCCTG 
TTTGAGACCA 
AGCCAGGCCT 
TCACTTGAAC 
CTGGGAGACA 
AGAACAAATC 
CAAAAGGCAC 
ATATAAGCAA 
TACCAAGCAA 
AGAGGAAGGA 
GTGGATGTTT 
CTTCTTTTTG 
CTAAAGG CAA 
CTTTAGGCAG 
TAAACAGTAC 
TGTGGTTGAC 
AAGTAAAACA 
GGCGTGCAGA 
CATCTCTACT 
TACTCTGGAG 
CAAAATTATA 
ACACACACAA 
TGTTATTTTA 
ATAAACTCTA 
AAAGAAATAA 
AGGAAAAAGA 
TTAACTCTAC 
AAGCAGAGGT 
AG C C ACAAC A 
CACCATCACT 
TTTATTTATT 
GCTGGAGTGC 
TTCTCCTGCC 
TAATTATTAT 
GCCTGACCTC 
TCTTCATTTA 
GTAACTTCTT 
TCTTCTAGAG 
TTACTTTAAA 
TCCTACTCTG 
CCTACTGAGC 
GTGCCACCAT 
GCCCAGGCCA 
GCTGGGGTTA 
TGTGTATAAA 
TCTCTCTCTC 
CAAACATTCT 
CCTTTGGGAT 
TGTAGAATCT 
ACAGATCTCT 
ATTATTATAT 
ATTAGAATGA 
GATTTCAAAT 
ATTTTTTTAG 
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188281 ATCTTACATG GATGGCAGCA GGCAAAGAGA GAGCTTGTGC AGGGAAACTC CCGTTTTTAA 

188341 AACCATCAGA TCTCGTGAGA CTCATTCACT ATCATAAGAA C AG C ATAGGA AAGACCCGGC 

188401 CCATAATTCA GTCACCTCCC ACTGGGTTCC TCCCAGGACA CATGGGAATT GTGGGAGTTA 

188461 CAATTCAAGA TGAGATTTGG GTAGGGACAC AGCCAAACCA TATAAATAAC TAATCATCAG 

188521 GGAAATGCAA ATCAAAACCA CAATAAGGTA TCATCTCACC CCAGTTAGAA TGGCTATTGT 

18 8 581 CAAAAAAACA AAAAATAACA AATG CTGGTG AGGATGTACA GAAGAGGGGA CTCTTATGTC 

188641 CCACTGGTGG AAATGTCAAT TAGCATAGCC ATTATG CAAA ATAGTATGGA AGTGAGGTAG 

1887 01 GTTACATAGG GTGGTCACAG CCTCCCTTGA AAGGAAACAA GAAACTTGTC AAATTGATGG 

188761 AGAGAACAAA TCTCTTGACA TTACACAAAC TGCATCTGGG GCTAGTGGTT AGAATATCCT 

188821 CAGTCAAGGA GGTAGAAGAG CAGGAGGGAA AATC CCTAAG TTCGTGCAAG TGCAGAAACC 

188 881 C ACAAG CTGT GTTCTCAGGT TGACATATAC TCATTTTAAT AGTAAGAAAC ACACCCTTGG 

188 941 GTAGAGAATT AAAATGCTAA TAATACATGT GATGTATGTA CTAGCGTGTA TGGCAATATT 

18 9001 GCATGCACAT TCAAG AG AC C ACCCAAAACA TATTTAACAA CAATGCCCAT TCCCACCCCC 

18 9061 TCATGGATAA TCACGTAGGA CTCCCATAAC GGGAGTTTCT TCAGTGTCAA TTGGTGCTGA 

189121 AGTAGCCGAC CCTGACTCTG CTATCAGCGT GTACTTTCAC CTTGCAATAA ACTCCTTTGC 

189181 CTACTTTTAC TTTGGACTGG CTTTCAAATT CTTTTGTGCA GGGAATTCAA GAATCTGAAC 

18 9241 CAGCCTACTG ACAACAGAGG TTTCTCAGAA ACCTAAAAAT AGATCTACCA GATGAGGCTG 

189301 AAAATCTGCT ACTGGCTATT TATCCAAAGG GAAGGAAATC AGTATACAAA GAGACACCTA 

18 9361 CATCCCCATG TTTATTGCGT CACTCTTCAC AAGAG CTGAT ATATAGAGTC AACCCTAAAT 

18 9421 GTTCATTAAC AGACAAATGG ATAGAAAATG TGGCATATAT ACACAATGAA ATACTATTTG 

189481 GCCATGAGAA GAATGCAATC TTGTCATTTG TGGCAACGTA GATGAAACTG GAGAACATTA 

189541 TGTTAAGTAA GATAAGCTAG GATTGGAAAG ATAAATACTA CATGTTATCA CTCATATGTG 

1896 01 AAAGTAGAGA AAAATTTTTA GCTCATGGAT TTAGAGAACA GAACTGTGGG TACCGGAAGC 

189661 TGGGAAGGGT AGCAAGGAGG GGAGGATAGG GAGAGGTTGG TTAATGGTGA CAAAATTACA 

18 9721 GCTAGATTGT AGAAATGAGT TCCGGTGTTC TGCACCATTG TAGGGTGCAT ATGGTTAACT 

189781 CTCATTTATT GTATATTTTC AAAAAGCTAG AAAAGAATTT TGAATACTCA CAACAAAATA 

189841 AATGATAAAT GTTTAAGGTG ATGGATATAC TAATTACTCT GATTTGATTA TTACACATTG 

189901 TGTACACATA TAAAAATATC ACTCTTTATC CCGTATATAT GTACAGTTAT TATATGTCAA 

18 9961 CTAAAAATAA AAGAAAAAAA GAATATGATC TATCATGATG TATATATCAT GTGTACTTGA 

190021 GCAAAATGTG CATGCAGATA TTGTGTATAA TGTTCTATAA ATCAATTAGC TCAAGATAAT 

190081 AGATAGGATT GTTCAGATCT TCTGTGTCTT TACTGATATT TTGTCTAGTT ATTGCATCAT 

190141 TACCAAAAAA AGGGTGTTAA ACTCTCCAAA TGTGATTGTA GAATTGTCTA TTTTGTCTTT 

190201 TCTTTTCCAT TTTTACTTTA TGTATTTTGA AACTCTGTTA TGACATTTTG CTATGTATTT 

190261 TAAAACTTCG TTATGTATTT TGAAACTCTG TTGTTAGAAT CATACATTTA TGATTATTAT 

190321 GTTTTCTTGA TGAAATGACA CTTTTCTATT GTCATTGTTT TTGTTTTTTC TGAAATGGAG 

190381 TCTCACTCTG TTGCCCAGGC TGGAGTACAG TGGCACAATC TTGGTTCACT GCAACCTCCA 

190441 CCTCCTGGGT TCAAG CGAGT CTCCTGACTC AGCCTCCAAG TAG CTGGGAT TACAGGCATG 

190501 TGCCAGCATG CCAAACTAAT TTTGTATTTT TATTAGAGAC AGAGTTTCAC CACGTTGGCC 

190561 AGGCTGGTCT CGAACCTCTG ACCTCAGGTG ATCCGCCCAC CTCGGCATTT TTATTTTATT 

190621 TTATTTTTTT GAGACAGAGT CTCACTCTGT CACCCAGGGT AGAATGCGGT GGTGTGATCT 

190681 TGGCTCACTG CAACCTCCGC CTCCTGGGTT CAAGCAATTC CCATGCCTCA GCCTCCCGAG 

190741 TAGCTGGGAT TACAGGCACA TACCACCATG ACTGGCTAAT TTTTGTATTT TTAGTAGAGA 

190801 TGGGGTTTTT CTATGTTGGC CAGGCTGGCA ACTGACTCCT TTAACAATAC AAAATATCAC 

190861 TCTGTCTCTG GTAACACTCT CTGTCTTAAA CTCTATTTTA GCTGTTATTA TTATAGC CAT 

190 921 TTTAGTCTTT TTATGCTTTC TGTTTGCATA GTGTATATAT TTTAATATGT TTATTCTCAA 

190981 GTTATCTGTG TTTTTATATT TAAGATGTTT CTCTTCTAGC CAACGTGTTT GGTTCTTGCA 

191041 TTTTTAAGTC GATTCTAACA ATCTTTGCCT TTCAATTGAA ATATTTACAC CATTAACATC 

191101 TAACATTAAC ATTTATTTTT CTTTCCACAG TACACTGGCT AGCATCTCCC ATATAATATT 
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191161 GAACATAAAG TGTGATAACT GACATCCTTA TTTCATTCCT ACTCTGAGTG GAAAGGGCAG 

191221 GGGTGGAGAA AGCATTCAAC AATTTGCCAT AATTATAATG CTTTTTGTTA CACTGTTTTC 

191281 TTCTGCATTA AAAAATATCA TTACATTTTG CATGAATTAT TAGGAGAAAA TATTTTC CAA 

191341 TTTTCCTGGA AAATGCCATA ACCACGTCTC TCAATTTTGT TTCCATCTTT CTTCCACATT 

191401 TTACATAACC TACATAAGAG ACACATTATC AAGTATATTT TACATGGCTT CTCAGTGTCT 

191461 TCTCTGTCTG CTAACAGGTT TACCAAGAGA TGGCACTCTT GTATTTCTGG TGGCTATGTC 

191521 CATATCGTTT TGCCTTTAAG ACAGCGTAAC TACTTCTTTC ACCAGTATTA AAGACATGTA 

191581 CATTTGATCT GGTTCTTGTG GATGATTTTA AATGACTCAA GCTAATAATC CTAATTTTAC 

191641 CTAAACACTC CATTATTTTA AAATGTATTC CTTTATGCCC ACAATAAACA TTTATTGACA 

1917 01 TTAGGCTGGA CATTAGGCTT CTCTATGGCA GACATTAGGC TGGACCCTAG CCATATATCT 

191761 ATTGAGGGAA AAAAAATTAT TTTCTATATA AGTTTCCAGA AAGCCAAGAT GTGTTTTAAA 

191821 AACAAAACAA AACATTACAT TCTAAATGCT GTAACAAGAT AAGAAAAAGT GTTGAGGCTG 

191881 AGAGAAGAAC AAAGCAGCAA GCAACTCCTG GAAGGACCAC TGCTGCAGAG GTAATAACTG 

191941 GTGAACCATG TTTTGGAGAA GGAAAAGGTC ACCAAGAGAA GGAGGGGGTC CAGGGTGTTC 

192 0 01 AGAAAGATTG CATGCATAAA GATCAAGGGT AATAAAAAAA ATTCCGTATT ATGTAAATGT 

192 061 GAAGTTCCAG GACCATGAGC TTGGAGAGCA TGAAGTACAG GAGGAGGGTT GGTTTCAAAT 

192121 AAATCTGGGA ATGAAACAGT GAAGCCTCTG GCAGAACTCA CATCTCTTTC CTCCCCTCTT 

192181 CCTTGCACAT TCCCTTTATG GAGTAATTGC AGGGATGGGA AAAGTTCAAA ACCACCACTG 

192241 AGCCTAGGAA GTGCTAGGGT AAAGTGGAGA ATGAACCTGC GTGATTTGCT CATCCTAAAC 

192301 TAGGTTCTTC TAGGAGAGCC CTTCCCCATA AAATCTGCCC TCCTCGAAGG GGCCCAGACA 

192361 GCCTAAGCTC ACCTCCCAAA GACCCCTTAC TTGCTGACTG AATCTGATTC CACCCAGACA 

1924 21 TGGCCTAAAA CCCTTCCATA ACTCTATAGC CAAATTCAAT TTTAGACAGG CCTCATACCA 

1924 81 ACCTTTCTTC CTCTAAGTCT GCCACCCTAG GCAATTCTCA ACATTCTCTA CACACTTTGG 

192 541 GGCCATAGAC GTGCTACCAA GTCTCCAGAC CTAGACCTGA TGGAGCAGTG CTGTAATGAG 

192 6 01 ACGACCACTG GCCTTTGAAC CAGACCCTTC TCTGTGGCTC CTATGCATCT CCAACCTGTT 

192661 TTGAGCACTG CTGCCAAGAC ATCTTTGGCA CTTTGTTGTG AAGTTTTAAA ACTGAACTAA 

192721 TCTACAAAAC ACCTAACCTT TAAAAATTCA TTGTCATTTC ATATCATGAA AGATAAAGAA 

192 7 81 AGGCCAGGAA ACTGTTCCAG GTTAATAGAG ACTAAAGAGA TAGCAACCAA ATG CAATTTG 

192 841 TGATC CTGG A TTGAGGGGAA AAAGTGTTGT CAGAGACATG ATTGGGACAG CTGGTAAAAT 

192 901 TTGAATTTGA ATTTAAAGAT AAAGTATTGA GTAATATAGG AAGATGATTA TCTGCAACTT 

192 961 TCAAATGTTT CAGTAAGTAT ATATATATAT AAAGAGATAT AAAGACATAT AAATAAATGG 

193 021 ATAGGTAGAG AAAAAGCAAA TGTATAATAT TAACAATCTA GGTAAAAAG T ATATGAGTGT 
193 081 TCTTTGTACT GTTTTTCTGA TTTTTCTATA TGTTTGAAAT CATTTTAAAA TAAGAAGGTT 
193141 TTTGGGTTTT TTTTGTTTGT TTTTTGTTTT TAG AG AC AG C ATCTTATTCT GTCACCAGGC 
193201 TGTAGCTCAG TGGCCCAATC ATTGCTCACT GCAGCCTCAA CTTCCTGGGC TCCAGTAATT 

1932 61 CCCCCTACCT CAGGCTCATG AGTAGCTGGT ACTTCAGGTG TGCACCACTG CACTCAGCTA 
193321 ATTTTTATTT TTTAAATTTT TGTAGAGATG GCATGTTGCT ATGTCACCCA GGCTAGTCTC 

1933 81 AAACTCCTGC CCCCAAGTGA TCCTCCCACT TTGGCCTCCC AAAGTGCTAG AATTATAGGC 
193441 ATGAGCCACT GCACCCAGCC CCAAATAAAA AAGTATTTTA TTTTAATTAA CTAATTAACT 
193 501 TTGAGTCAGA GTTTCACCCT TGTCACCCAG GCTGGAGTGC AATGGCATGA TGTTGGCTCA 
193 561 CTGCAAACTC TGCCTCCTGT GTTTAAGCGA TTCTCTTGCC TCAGACTCCT GAGTAGCTGA 
193621 GATTACAGGT GCCTGCCACC ATGCCCAGCT AATTTTTATA TTTTTAGTAG AGACGGGGTT 
1936 81 TCAGCATGTT GGTCAAGCTT GTCTCAAACT CCTGACCTCA GGTGATCCAC CCACCTCCGC 
193741 CTCCGAAAGT GTTGATGAGC CACCACACCC GGTCTAAAAA GTATTTTAAA ACCACAGTCC 
1938 01 CACTCTACCT TGTCCTACAC TACCAGGGGC TAGGATCACC CCATGTCTTC TAGGCTATGA 
193861 GATAGAGGAA TCCAAGGAAG AAGATAAGCT ACTTGGTTCC TCTATAGGGT CTTGTGTGTG 
193 921 CTCTCATGTG CTCTCTCTCT CTCTCTCTCT CTCACACACA CACACACACA CACACACACA 

193 981 CACATGAATA CCAGAGCTAT CACTTTCCCA GTCTAGTACT CATCTCATCC CAAGGGTTTT 

194 041 GTGTTGTAGT GGTTTGCTCA TTTCTTTGTT TTGTTTGTTT GCTTGGATTA TTCTTTTTCT 
194101 CTTTTTGCAG CTGAAGGGAG AATTTCCAGG CCAGCCCTTT GGCCATTAGA GTTACAGTGC 
194161 CTCTATTCAG GCTTCATAGA GAGACCTGGG ATTCAGTAGT GGGGGGCTTT TATCCAGTTC 
194221 AAAATAATGC ATTCTCACCA AGATGTACTT TGAAATAAAA CAATACTAAA ACACAAAATT 
1942 81 TTATTTATGC TGAACATTGA ATCACTTTTT TCTGTATTTT GTGTAGAAAG TTATACACAC 
194341 ACAAACACAT TTGCTCCTGC TTTGTTTATT GGCCCAGGGG TATGTTTGGT AATACTTCAT 
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TCTTTCCATT 
AAATTCTTAT 
CAGTCATGTT 
AAGGTGCTTC 
GTTCTGCCTG 
TTCTAGTCAC 
GTCTCTGTGT 
AGATCTTAAT 
TCAAAAAAAA 
TTGGGAGGCC 
ATGGTGAAAC 
TGTAATCCCA 
ATTGCAATGA 
TCAAAATAAT 
CTCTGATTGG 
TACAGGATGT 
TCAACCCATC 
CACACCCCCA 
C C AC ATGTTC 
TGTTCCTGCC 
ACTTTTCTTC 



AAACAGTCTA 
ATCATTTAAT 
AAGAAAACCT 
ACTATTTCAT 
AATGAAAATT 
TTTAAAATGT 
TTTCCTGTGA 
TTGAGGAAAT 
GTCATGACCT 
TGGTTTGGAT 
CAGTAAATAA 
AGTCAGATTC 
TATCTCTCTC 
TGTATATATC 
TACAGTACTA 
GTATATATAC 
ATGTATCCTA 
CATAGTTTAT 
T TTC* TT1* T < T T 
CACCTCACTG 
TAGCTGGTAC 
GACGAGGTTT 
TGCCTCAGAT 
TACACATCTT 
TATTATGGCT 
AATATTCAGA 
AAATGTATTA 
TTAACTGTGT 
GAAAATTTGG 
TCAATGTGTT 
AAGAATATAG 
TTTCTGCCCT 
TATAATTCTT 
TCTATTCCTG 
CTTTCCAGTC 
GAACACTTCT 
CCCTGTATTA 
GGAAGCCTTC 
CTGTGCAAGT 
GTGGTCTCAT 
AAATTAATAT 
ATATAGTATG 
GAGGCAGGAG 
CTCATCTCTA 
GCTACTCAGG 
GCTGAGATCA 
AATAATAATA 
AAATAGCTGT 
GCAGGTTTGT 
ACCTAGGTAT 
CCCCATCCTC 
TCATTGTTCA 
TTAGCTGTTA 
TACATTACTC 
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TTGATGTTTA 

ATTTAACTGA 

GCTACAGAGA 

TAATGTTTAT 

AACTGGGTGA 

TAAGATTAAG 

AGTCCTAACA 

ATCCTAGTGA 

AATATTATCA 

GATCATAGCA 

AAAATGAACA 

AACAAATTAT 

GGCCACTCTT 

ACTTTTTCTA 

CATACCTTTT 

TCTTTCTCTC 

TCTTTCTTTC 

GTGCAGTGGT 

TGAGTAG CTG 

GGAGACGGGA 

TCCTGCCTCA 

ATAACTATTT 

TTTTAAATAA 

AAGGGTAGGC 

ATATATTACC 

TGGGCAGATT 

GAGACAGGCA 

TAAGGTGGAG 

AAGAAATGGA 

TTGGCTCAGT 

TTTGTGTTGT 

TTAAATACAG 

AGTATGGTGA 

CAGAGAGGGA 

AGCAACCACA 

CAAGCAGAAA 

GCAGCTTCCT 

AAATGCATAT 

TTTATTGAGG 

TACAGTAACA 

TGTCCAAGCT 

AGGAGCCCTG 

ACTGAATGGT 

CTAACAGTGA 

TCTGAGAAAG 

CTGTAAAAAT 

C AAGTTAGCC 

TCTTCTTTTT 

CAGTTTTAGG 

ATATTGCCTA 

CCTAGGGCAA 

TCCAAGGACG 

GGAATGAAAT 

TTTCAGACTT 



TAAATGTTAC 

GTTAACTTTG 

TCTTGATTGT 

TAAGCGTGTA 

CTTATAAACA 

GCCCTGGCAA 

TGGCAGAAGG 

TGAGGTTTCT 

CTTTGTGGGT 

CACACCATAG 

AGATCCCCTC 

GCAGTTTGAA 

TCCTTATTAT 

CCAGAACTGG 

GGGTAAATTA 

TTTCTCTTTC 

TTTCTTTCTT 

GCAATCTCAG 

GGACTATAGG 

TCTCCCTATG 

GCCTCCCAAA 

TCATTGGCTT 

AGAAAGGAAG 

TGAATGTTGT 

TTATATGACA 

TTCCTGAATT 

GAAGAGTCAG 

GAAAGGCCAA 

TTTTCCCCTA 

GAAACCCATT 

TTCAAGCCAT 

AGATCTGAGG 

GACTCACTAG 

AGGCTCTCAT 

GTCTCAAAAT 

ATGGATTAGG 

GGGAAGTTGC 

GGAAAATCTA 

AGCTACTACC 

CAATCCTTGC 

CAGGGAAAAC 

CCCTTTCCTC 

TGTATGCACA 

GAACTTGAAA 

TGGGGCCAAA 

CTGCAAAAGT 

TTATAGTCTA 

GGGTAGAAGG 

AGAAGTCAGA 

ATTCCAAAAT 

GGCTTGAGAA 

CTGTGAGTAT 

ACTGATCAGA 

CTATATGAAT 



AACTTCTTTA 

TTATATGAAA 

TGGTGGTGAA 

CTGTCTTAGT 

ACAAAAAATT 

ATTTAGTGTC 

GTTGAATAAA 

GCCCTCATGG 

TAGGATTTCA 

GACAGACACT 

AG AGAG CTTG 

AAATCTACTC 

ATTAAATATT 

TATCAGCTCA 

AG CCAAGAAA 

TTTCTTTCTC 

TTTCTTTCTG 

CTCACTGCAG 

CATGTGCCAC 

TTGCTAAGGC 

GTC CTGGG AT 

ATCAGGCACA 

GAATTGTTTC 

CCTCCAAAGA 

AAAGGGACTT 

TTGCAGATGG 

AATAAGAGAA 

GAG CCAAAAA 

AAGCCTCTGG 

TTGGACTTCT 

CACAGTTGTG 

AGTTGAGTAG 

GATGGCGGAA 

TTCATTTTAT 

TAATGATTAC 

GAAGACATGG 

CAGGGCAGTC 

GCTGACTTAA 

AATTAGAATA 

TTTACTAAAG 

ACTAGGAAGT 

CACACCATGC 

GTTCAAAGCA 

ATGAAGTTAA 

CCTGTTGCCG 

ATTTGAAAGG 

GGGCTTAAAA 

ATTATTACTA 

GAAAAGACAT 

GAAGAGACTC 

GCCACTGGTA 

TAACTGTGAT 

ACTGACAAGA 

TTGAAATGGT 



ATTTCATTTA 

ATCATGATTG 

GCAATGCAAG 

CTGTTCAGAC 

TATTTCTTAC 

TGGTGAGGAC 

CTTCCTTGGG 

TATAACTACT 

ACATGAGTTT 

GTGCCAAGAA 

CAAAATCCAG 

TGAATCTTAC 

TACTCTTGTT 

TGCTCTGCCT 

GTTCTCCTTT 

TCTCTCTCTT 

ACAGGGTCTT 

CCTTGAACTC 

AACATCAAGC 

TGGTCTTGGA 

TACAGGCATG 

TGATAACTAT 

AACTCTTCCT 

TATCCATGTC 

TACATGTTTA 

GCCCTAGTGT 

AAATACTTCA 

ATGCAGTGGT 

AGGGGGCACA 

GACCTTTAGA 

GTAATTTACT 

GATAAGCCTA 

CTCAATTAAG 

AAGGGTTGCG 

AAATAGGACA 

ATGATATGAA 

ACAGTTCACA 

CTGAACTCCT 

TGTATTTCAT 

CGGAAGCCAA 

GAATATGGGT 

CCCCTTGCTT 

GAAACACACG 

AAATTAAGCG 

TCTGACTGCC 

GAAGAAGGGA 

TACTGGTTTA 

ACTTACCAAA 

TAACAGCAAC 

TCTGAAAAAG 

CCAATGGACA 

GCTGTGATTA 

TTTGTGTTTG 

CTCTCAGGAA 



ATGTATACCT 

GGAGTGAGGG 

AATTCATTCA 

TGCTGTAACA 

AGTTCTGGAG 

AGGTAGCCAT 

TTTCTTTTAT 

GCCCAAAGAC 

TGAGAGGATA 

TTGTGGATAT 

CTATAAAATT 

TTGTGGCATT 

TGGGGGATCC 

TATGCAAATT 

CTTCTCTTTC 

TCTTTCTTTC 

GCTCTATTGC 

CAGGGCTCAA 

TAATTTTTGC 

TTCCTGGGCT 

AGCCACTGCC 

AATAAATCAA 

GCTACCCCTC 

CTAATCCCCA 

ATAAGTTAAG 

AATCACAAGG 

AGATGTTACA 

CACTACAAGC 

ACCTTGCCAA 

ATTGTAAATA 

ACAACAGCAA 

CTCCAGCAGG 

GAAGTCTGAA 

TCACACTAGG 

CAATTCCAAG 

ACAGGAAGGA 

TTCATTAGGC 

GAAGAGGAAT 

TTGTTCAATA 

TTCAAAGAGG 

CTG AC TC CAT 

TCAGAAAAAA 

ATGACATCTT 

GCAAAACCAA 

ACGTGGCTCA 

CAGAAAACTC 

ATGGTGAAGG 

GGTCCATTAA 

ATAAGGATCT 

ATAACTGATT 

CTGTGGACAA 

GTCAGACTGG 

GGACTGTGGC 

AAGGAGAACA 



TATTGAGTTG 

GGTTAAACCA 

TTCAGTAAAC 

AAATATCATA 

GTGGGAAGTC 

CTTTTTGCTG 

AAGGACACTA 

CCCTCCTTCT 

CAGACATTTG 

AGTGATTCTC 

ATGCTTTTTA 

GAATACTTTC 

AGTCTCACCT 

AAGAAAATAT 

TCTCTTTCTT 

TTTCTTTCTT 

CTAGGCTGGA 

GCAATCCTCC 

ATTTTTTTGT 

TATGCGATTC 

CCTGGCCATT 

TAACCAGAAT 

TATCCCTCAA 

GAACCTGTAA 

AATTTTGAGA 

GTCCTTATAA 

CTGCTGGCTT 

TGAAAAGAAA 

TACCTTGATT 

AATAAATAAT 

TAAAATAGAA 

TTATTTCGGG 

GCTGATAAGC 

AAGATCCAAT 

AGTCGGGAGC 

GGGGTACAAG 

TGTGGGCACC 

G.AACACCTCA 

ACCCCATGAG 

TTCAGTGACT 

CACTGATTTC 

AGGCTTGTTG 

TTGAGATACT 

GCCGAGGCTT 

CTATTTATCC 

CCTCCTTTTC 

TAAGTGCTTT 

GGGGAGGGAA 

CCATCTGGTA 

CAATGAAGAC 

TGGTCATTTC 

GATTGGCTGT 

TAACGAGTCT 

TGGCCGGGCC 
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TGGTGGCTCA 
GTCAGGAGTT 
AAAATTAGCA 
ATTCAGAAGG 
AGATCATACC 
TAATAATAAA 
TTGTACGCTT 
CCAGGGACAT 
TCAGTTTTTC 
ATCTTGAGGA 
AACAACCAAT 
TGGATCTGAG 
TATCTAGAAT 
TGTTTTCACT 
GAAAGCCCTA 
GAATTCAGTC 
TTTACTCTGA 
TAGGAGAATC 
AAAAAGGGAA 
TTTTTCTCAA 
TAACTTAATT 
TTTTTTTTTA 
ATTGTTTGCA 
GTGTTCAGCC 
ACCATGAAAG 
CATGCAAAGG 
GCATTTGAAA 
AGAATGGGAA 
AAGGAGAAGT 
TTCATTATCA 
CTCCTTTATA 
ATTACGAAAA 
GATCTGGTGC 
AATTTTCTTA 
TCAGTGTTTA 
CAGCTGCATG 
TGAAATCAAA 
TCTGAGTGTA 
CTTCTAAGGC 
TGGGCCTCCA 
ACACCCAAAA 
GAAAGCAGAA 
TTTAGCTACT 
ATATAGAAAG 
ACTGGCACTT 
TTTGTACACT 
AAATAGCTTC 
ATTTATCTCT 
TTTTCATCTA 
TAAAGTACGG 
AAAATAGAGG 
TGTCCCCCAA 
GTTCAAGCAA 
ACACCCGGCT 



CGCCTGTAAT 
TGAGACCAGC 
GGGCGTAGCG 
CTGAGGCAGG 
ACTGCACTCC 
AGAAAAGGAG 
GTAGAGATCA 
TGTGGTCAGC 
TAACACTGAA 
TTCCAGGCCA 
CAAAGGCACT 
AGGGCTAGAC 
ATTATACAAC 
CCAAACTGCC 
AGAGTTTTTG 
CTTGAATATC 
GTAAACTAGG 
TAAGAGCAAT 
AAACCTGTTT 
ATTCAACCTA 
TCAATTCATT 
GAGGTATATC 
AATAGGTTAA 
TCCATAATTT 
ACCTCAGTTT 
AAAATAGGAA 
CCATAAAACT 
GAAAAACTAA 
AGTTTTGTTG 
AACTTCCTTC 
ATACCTCAAA 
CAAAGGGGAA 
TGGCTCTGTG 
TCAATAAAAT 
TAAGATTGGA 
TACAGTTAAA 
TAGAAAATGG 
GAGCAATGAG 
AGAGCAACAA 
TTTGGGTCCA 
TTCAAAATTT 
AACTATACTT 
TCTGAATATA 
TCACAAAGAA 
GTGTGAATCA 
AGAGTACTAG 
CATCACAATA 
AACATTATGC 
TGCCTTTCTT 
ACAAGTCTTT 
TTTGATATAT 
GCTGTAGTGC 
TTATCCTGCC 
GTTTTTGTAT 



CCCAGCACTT 
CTGGCCAACA 
GCGCGTGCAC 
AGAATTGCTT 
AGCCTAGGTG 
AACATGACCA 
CCTAGCTTGT 
ACCAATTTCT 
GGAATTGCTT 
GAAGAATCCA 
TTTATAGAAA 
TGCCAATATT 
ATATAACACT 
CAGTGAAGTA 
AAACTTTACC 
CTCCCACTGC 
ACTCTGGGCT 
CTAACGACCA 
TTTTGCTTAT 
TACTGTGTCA 
TTTCTGTGTC 
TCCTTAAAGG 
TAATTATTCC 
TATTAGCTTC 
AGGATGGTTT 
ACGAAGAACT 
TAAGTAGGGC 
AGTGATGGGT 
TGTTTATGTT 
TAGAATAAAT 
ATGGAG CTAC 
GTTGTAATAG 
CATCATGTGG 
AAACAGTTGG 
GAAGTTGGTG 
GAGTTGCCTT 
TCCAAACTCT 
CTGAACTCCT 
CCAGTATCTG 
TTGTACAAAA 
AGATGGAGAG 
GCCATGCTGA 
AAATGAGTAT 
TGAAGTGATC 
GTTTCTATTC 
CATTTTTCTA 
ATCTATCAAA 
AGTCATGAGT 
TATAATCCTT 
TAAATTTTGT 
GTGTTTTTTT 
AGTGGCACGA 
TCAGCCTCCT 
TTTTAGTAGA 



TGGCAGGCTG 
TGGTGAAACC 
CTATGCGCAT 
GAACCCAGGA 
ACAGAGTAAG 
AAGTTATGAA 
TGCCCTCATT 
CAGAAAGATA 
CTATGTTTTC 
CTTTAAAAAA 
TACATTTCAT 
GTGACTGTTT 
TTGCAATTTA 
ACATTATCCC 
TGGTTTACTC 
AGGTTCATGC 
AACAGAGATG 
TTATAATAAA 
GCGTATACCA 
AGTAATTTTT 
TACTTACAAC 
GAAGGGTTCT 
AGTGTCTCTA 
TGTGCTTATT 
GGTATGTTAG 
GAAATTACCT 
TTTTCATCCT 
TTGTGATACA 
TAATATTCAA 
GATTAAAACT 
CCCATTGAGT 
GTAGAACAAG 
TTTCAGGCAA 
GCCAGAGGAT 
GGAAAGCTTT 
CAGCCAAGCC 
GGGTTTGACC 
GATATCCAGA 
TCCTGGTGCT 
CAACAACAAC 
ATACTATTCC 
TGAAGTCCAA 
CTACTAATTA 
ATCCTGTTTT 
CTGTATGTGG 
ATGTAATTCA 
TTGACTTGCC 
AATACTACAA 
CATCCTAAGG 
GTGCAAAAAC 
GTTTGTTTGT 
TCTTGGCTCA 
TAGTAGCAGG 
GATGGGGTTT 



AGGCGGGCAG 
CTGTCTCCAC 
GCATAGTGCG 
CGTAGAGGTT 
ACTCTGTCTC 
TAAGACTGAA 
GTACAGCTAA 
GGCAGATGAT 
TGGTGAACTC 
GAAACATTTA 
TTGCTGTAGG 
ATTATTATTG 
CGAGGCATGT 
AATTCTTCCT 
AATTTGGGAA 
TCTTTGATCT 
AAGCAAGACA 
ATCATGAGTT 
TAATATTTAC 
TTTAATATAA 
TTTGG CACTA 
GACACTGTTA 
AGTACATATC 
TTGGAAAAAC 
CCTGATTTCT 
ATTGATACAA 
TTCTCGTTAG 
ATTCCAGTAA 
AGCTCAACCT 
TGATTTAAAA 
TTTAAGCTTG 
CAGTAGTCTA 
CTTTTCAAAT 
CTCTGAGTCT 
AAGTGGAGTG 
ACGGGATCTT 
ACAGATGACT 
TGTTAGCAAG 
GACCTGATCT 
AACAACAATA 
CAGAATTCTA 
TTATTGCTCT 
TTTACAAAAT 
GTAACCCAGA 
ATGTGCACAG 
ATATTGTCGA 
AGACTCTCAT 
AGGATATTTT 
TCACAGATTA 
AGTGCAAAGC 
TTTGAGACGG 
CTGCAACCTT 
GTCTACAGGC 
CACCATGTTG 



ATCACTTGAG 
TAAAAATACA 
CGTGCCAGCT 
GCAGTAGTTG 
AAAAAAATAA 
GGCAAGAAAA 
GAAAAGGCAC 
GAGAGGGCCC 
CTCCCCACTC 
AAACCAATTT 
CCTGTATTTA 
CTGTTGCTAG 
CTCATACTTT 
ATGAAACAGT 
TGGCAGAGCA 
AGGTGTAACA 
GGCTGGATAT 
CTAGACTTAA 
ATTATTTATT 
CATTTTCCTT 
GAATTCACAA 
CATGTTCTCA 
AACCATGCCA 
ATTTCCCATT 
GCATTCGTCT 
AATCAAAGTA 
ACAGCAACAG 
CATAAAGAGC 
AAAAGTATTT 
TATACAAATT 
TGATTAAAAT 
GGCATTAGGG 
TTTCTACGCA 
CTTTCAGCTT 
TAAGTAATTG 
GCATAAAAAG 
TCAGCTAGGA 
ACTTGGAGGC 
TACT AG C AAT 
AAATCTCCAA 
GAGATATTTG 
TTTAAATACA 
CACTTGGTAA 
AATAGTCATT 
CGTATCCTGC 
AAACATTTTA 
TATTAGGTTA 
TGGACACAAT 
TGAATATCTT 
CTTGAATGAT 
ATTCCTGCTC 
TGCCTCTTGG 
ATGTGCCACC 
GCCAGGATGA 
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TCTCGAACAC 

AGGTGTGAGC 

GTCTTGAAGG 

AATGGAAAAT 

ATCCCCAAAG 

CCATGAATTC 

CCTGTTCATT 

ATTGTGTGTA 

GTTATGTAAT 

GGGGAGAGAA 

CTTGAGCTTA 

TTCGCATGCT 

TGCACTAAGA 

TTAAATGACT 

CCTGTAACCA 

GTCAAAACTT 

TTATGTTTTG 

GGCACAGGGA 

GAAGCCTGCC 

TGAAGCTTAT 

TCCCAGTGTA 

ATGAAATCAT 

GAATAAGTTT 

CAGGCTGGAG 

TGATTCTCCT 

GATGAATTTT 

ATCTCCTGAC 

GGCCACCACG 

TAATATTCTC 

AAATAATTAT 

TTTCTTCCAT 

CTTTTCTTCA 

TTGGTCTGTG 

TATATTGAAT 

TTAATTTTTT 

GACATGATCT 

GCCTCCCGAG 

TTAGTAGAGA 

GATCCACCTG 

CGTAGTTTTT 

CAATAACATT 

CCATCTCTCA 

GCTCGTTATG 

ATTCACTTAC 

ACTGTCATCC 

TTTCTCTATT 

GTTATAAGAT 

GTAGTGCTGG 

TATCAGATCG 

ATACATCTGT 

ATCATGCACT 

ATCGCCATGG 

GTTG CAGATG 

ATGATGGAAA 



CTGACCTCAA 

CACTGCACCC 

ATTTTAAGCA 

TGATGATATT 

ATAATTTGAT 

TCTTTGCCTG 

ATAATTTTCT 

GCAAACATGA 

GTCATCCTTC 

ACACCATCAG 

GTTTAATTGT 

TCTCTGTGGT 

AAGGAGCCTT 

AGCTCTAGGT 

AAAG CAT AC C 

TTACTGATGC 

GATTATGTTC 

AACTCCACCA 

ACCAGGAAAG 

TCCGACACAT 

ACATTGGAGC 

CTGTGAGAAA 

TGGGAAAGTC 

TGCAGTGGTG 

GCCTCAGCCT 

TGTATTTTTA 

CTCGTGATCC 

CCTGGCCCGG 

ACCAAGCGGC 

TTCCCAGACC 

TATATTTTTA 

AATAAGCAGG 

GTTGTTCCAT 

ATCTTCCAGC 

T T *T TT T T 

TGGCTCATTG 

TAGCTGGGAT 

TGGGGTTTCA 

CCTTAGCCTC 

TC CAAGTAG A 
AATGTATTAA 
TATAAACTCT 
AGCCCAGGGA 
CCATTCACTT 
TATCATGTGT 
CCCATATTCG 
CTATTTTGGT 
AGCTGTGAGA 
GTTGTTG TC T 
TCTCAAACTT 
TGAACACCAC 
CCTTCAATAA 
ATAGGGCTCT 



GTGATCCACC 

GGCCGATACA 

ATTTAAAAAA 

CTTAGGATAT 

TAGTATACAA 

TCACAATAGC 

AGGTGATATG 

GAATAAACAT 

ATGTTCGAAG 

AATGAAAGGA 

CCTGTCTTAT 

TTGATAAAGT 

ATCTTTATTG 

CACACAGCTG 

AGTAGTGCCC 

TAAGAGGAGG 

TCTCCATAGA 

CAAAGCGTGG 

GTAAAGCCAC 

TTACACATCT 

CAGCTCCAGC 

TTAAGCCAAA 

TTTTTTTTTT 

CGATCTCGGC 

CCCGAGTAGC 

GTAGAGATGG 

ACCGGCCTCG 

GAAAGTCATT 

TGGCTCTTTC 

ACCACTCTTA 

TCTGTGTCTT 

AAAAATCCAA 

TTTATGGGGA 

ACCACATCAC 

GAGATTGAGT 

CAACCTCTGC 

TACAGGCATG 

CCATGTTGAC 

CCAAAATGCT 

AAGTTGAACA 

CATTACACTA 

AAGAGAATCC 

CAAGCTTTGT 

ATTTTATTGA 

CATTTTGGAG 

TTCCTATCCC 

CTGTCTTGAA 

GAAAAAAAGA 

GAAAGGAAAA 

AGGTCCAGAT 

CACCATGATA 

TCAGCAG CAA 

CTCCAGCATA 

TTGTTGAGAG 



CACCTCAGTC 

TGTGTTTTTA 

TAAAGTCATA 

GGATTTTTCC 

ATATTAAATT 

TGGATTTATT 

AAGACTTTGT 

TTTAAACTTT 

GCACATGGAA 

AAGACCGCTC 

GGTCTGCTAC 

ACATGCAATT 

AACAGATGAG 

GAACTTACAG 

CATAAAATGT 

CAACATTAAC 

TAAAAGACTG 

TACCATTTCC 

TGCTCTTGTT 

CTGCATCACA 

CCCTGATCCT 

TAAGCAATAA 

TTTTTTTTGA 

TCACTGCAAC 

TTGGACTACA 

AGTTTCGCCG 

GCCTCCCAAA 

TTAAACCAAC 

TCCTGAGCTT 

TCTGTGAGCT 

CACAGGTTTT 

TCTATCATGC 

TGCTTTTAAA 

CTGCAAGCTT 

CTCATTCTGT 

CTCCTGGGTT 

CAT C AC C ATG 

CAGGCTGGTC 

GGGACTACAG 

TATGTGAAGG 

GGGAATTAGT 

TTGGATGTGC 

GATAAACAAA 

CGCTGAGAAG 

CAATATGACA 

CTTGAAAGAT 

GCCAACCAAA 

CAATGAGACT 

GAAAGGGGTC 

TTCTGTTCAT 

ACGCAGCGTG 

GGTCTATCTA 

TCCATCAAGG 

AAAAAACTTT 



TCCCAAAGTG 

AAGTCACAGA 

GAAGCTTCAA 

TAAAAGAAAC 

AAACATGTCC 

CACAATTGTA 

CAGTCCAAGC 

TAAATGTAAT 

CATTGTTCTG 

TGGAACCTTC 

AAG CAATACC 

TTTCATTTAA 

GAAATGAATG 

CCAGATTTCC 

AAGTTATAGA 

AAGGGGAAAT 

TCGTAGTAAA 

CACAGAAGCT 

TGCAGGCTAT 

CTGACCCTTC 

GTTGCTTTTT 

ATCCTGGGAT 

CTGAGTCTTG 

CTCTGCCTCC 

GGCACACACC 

TGTTAGCCAG 

GTGCTGGGAT 

CTATGTATGA 

GGAAACCTCC 

TTTTTGGCCA 

CTCTTTCTTT 

ACATGGGAAC 

GAAAAAATTT 

TGTAAAAATA 

CACCCAGGCT 

CAAGTGATTC 

CCTGGGTAAT 

TCAAACTCCT 

GCGTGAGCCA 

CAGGACCTAG 

CGAAGTGCTC 

AATACCTTAA 

TGTGCATAAC 

GTTATGTGAC 

TAAATGCCTT 

GGCCATATTT 

TAATTTGACA 

TCATGTGTCA 

TCAGTCAGGA 

TACGCTATGG 

TGAGTCTGAG 

ATGCCTCCAC 

AATTTGATAC 

GAAAGGAAGG 



CTGGGATTAC 

AATTTCAGAT 

TTTAGGAATG 

AAATGTATGC 

ATATTTAGAG 

GTAATTAGTC 

AAGTGTC C AC 

ACATATTAGT 

GTGGTACAGA 

CTCCTTAGCT 

ACTCTTCACC 

TTCTTCCAGC 

ATTAGAGAAT 

TTTTAACAAT 

GCTGTGTTGG 

TATTTGTGTA 

AGAGATTCAG 

AAATGGACGG 

GTTAATAAGC 

GTAAAGATAC 

CCTTAGCCCC 

CTAGGGAGTG 

CTCTGTCTCA 

CGGGTTCAAG 

ACCATGCCCA 

GATGGTCTCG 

TACAGGCATG 

ATCCCTACTA 

AGTAAAATGG 

TTAAAAATTA 

CACTTTAGTG 

CCTTTCAATA 

GTCCTTTCAA 

GTTCTACATA 

GGAGTACAGT 

TCCTGACTCA 

TTTTGTATTT 

GACCTCAAGT 

CTGCACCCCA 

TGACACATAG 

ATTTAAAGTA 

TTCAAAGGCA 

AGATGGGACT 

TGGCTCTGCC 

ACATGTGGGT 

GCTTTACTTG 

AAGTGGGTTT 

TCCAAAGTTC 
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210601 GATTCTGTGG AGTATGGAAG TATACATTTC CAATGACAAA TTAAAACTGA CTGGAACTAT 

210661 TTTTCTTTGA GACATTGCTT ACTTCAATAA TAAAAATAAG ATTTCATTGA GGTTATTATG 

210721 ATTATAAGGT GGGGGAACTG TAGAGTTAAA TGTGAAAAAT TTAAAAATGG AACAGTTTAT 

210781 GTGATGTCTT CAATGAAAAA CTAGGTATTA CCTGGGCACA TTCTTATAGG TTACTCAATC 

210841 CTATTCAGTT CTCTGCCTGT TTTATTGTTT CTGAGCAATT TTATATCCCT GTAAATTCTA 

210 901 TATAACCAAT AGAAATGCAA ACGATTCTTG TCCATAGCTT TGCAAATAAA TTTTGCCAAG 

210961 AGAAAAATCA GTTAAAACTT TTCTCCACTC ACCTCCCAGT TGAATTAGCC AATTTTGCTG 

211021 TTTGTTTGTT TGTTTGTTTT TTGAGATAGA GTCTTCCTCT GTCATTCAGG CTGGAGTGCA 

211081 GTGGCATGAT CTCAGCTCAC TGCAGCCTCC GCCTCCCGGG TTCAAGAGAT TTTCCTGTCT 

211141 CGGCCTCCCA AGTAGCTGGG AGTAAGGGGG CATGCCACCG CGGCTGGCTA ATTTTTGTAT 

2112 01 TTTTAGTAGA GACAGGGTTT CACTAGGCTG GTCTCGAACT CCTGACCTCA GGTGATCCAC 
211261 CCGCCTCGGC CTCCCAAAGT GTTGGGATTA CAGGTGTGAG CCACTGTGCC AGGCTCTGCT 
211321 GTATATTTAA AGTCTATTTC AGCATTGCTT CCTGCTTGTG TTATGCGTGA TTCTTTGAGT 

2113 81 TTTCCTTTGA ACCAGTTATA ACATCTTACT TACTTCCTCC ATTAATCAAT GAGTTAAATA 
211441 AAATCTTTGT TGTATGTTTA TTTTACATTT ATATGAAAAC CATGAATTTA CCCAATTAAA 
2115 01 AAAATTATCC TTTAAATTAT CTTGTACTGT ACATTTCCCA TGTCATCCCT ATAATTCATG 
211561 ATTAATGATT TTATTACATT GGACCTAGCT TATTTACAAT GAGTACATAA ATTTATTGTC 
211621 TCCAGTCTTT CCTCCATTAT CCCGTCTACA TATCCACACT GAG TAG ATTC ACTACTCAGG 
211681 AATCTTGGAC ACCTTCAAGT TGCCAAACAT GCAGTGTTCA CTGGACATGC TGTGTTCCTT 
211741 CAGAATTTGG GCCTGCTTCT CAGCACACTC ACATCTGCTA TCAATGACCC ATGGAAAGTT 
2118 01 TTTGCCCTGA GCAAGCCAGA GTCCCTGTTA GTTTCTTCCA AATGCTACAA GTTCACTTTT 
211861 GCTATTTTTT CCGATGAGAT AAAATTTTCC TTTTTGACTT TCTACAAATC ATAGTCATTT 
211921 TTCAAGGGAT AGTTCAAGTA TTGCTTCCTT TCTGGGACCT TCCCAAATTA TTATTTTCTC 
211981 CTCTCAAAGT CTCTGTTTTA TTTATGTTCA TCCTCAAATC TTGATTCTCA CATGAATCAT 
212 041 ATACCTTGTA TTATTTATAG TTTTTTTGAG TGGGTAAAAT ATTTCATATT TTATATTCTT 
212101 TGGCTCTCTA CTTTATAGCA TGATGCCAGA TATTTAGGGG CCTTATTGCA TTTATTTTTT 
212161 ATTTTATTTT AAAATCTATT TTATTTTTTA TTTATTTATT TTAAAATCTA TTTATTTTTA 
212221 GGTAAATATT CAGGTAATAT AATTTATGTA ATTATTTAGG AATTTTAGGT AGTTATTTTA 
2122 81 AAATAATTCA AATTATTTAT TGAGTTATAT CAGAAGAATG TGATCTTATT CATTTGTAAT 
212341 ATGTGTTTTA GGAACTCAGT TCAGCCAGGG CAGACCATGA TTCCCAAACT TGACTTTTCT 
212401 TTTTAATTAG GCACTGATTT TGGTTAAGAG TTCAGTAAAG TTTTGTGTGT GTGTTTTAAA 
2124 61 AAATTCTTTG ATATAAGAGT CAAGATGTTA CTCAACTTTT ACTAGAAGCA AAATAGAGGA 
212 521 AGTGCTTTCA CAGATGAAAT ATCTCTCAAT GTTTTCTTCC ATTTACTTCT TCCTATTATT 
212581 CATCTATATA ATCATTTTCT TTACCTCTTT TCTTCATTTC TTCTGTTTTT CTCTCCTTCT 
212641 ACTAAGACAA GCAAATTAGG GGTATAATTG GTTATTTGGG AAGGTAGGAA GAATATAGAG 
2127 01 AGAAACAAAA ATCAATATTT TATACTAGGG TCTCACTAAC CTCAAGCAAC TCTGACTGTA 
212 761 AAGTAGATTT TCATAATAGG ACTTCTTGAC AAAGAGTTTT CCTATTTTTC CCCCAGGCCT 
212 821 CTGTGTATCA ATGGAGCCCA GAAACTCAGG GTATCATCTT TAGCTCCATC AACTATGGGA 
212 8 81 TAATACTGAC TCTGATCCCA AGTGGATATT TAGCAGGGAT ATTTGGAGCA AAAAAAATGC 

212 941 TTGGTGCTGG TTTGCTGATC TCTTCCCTTC TCACCCTCTT TACACCACTG GCTGCTGACT 

213 001 TCGGAGTGAT TTTGGTCATC ATGGTTCGGA CAGTCCAGGG CTTGGCCCAG GTATCCAGAT 
213061 ACTTTCTCAT TCTTGGTGGG ATCCAGATTT CTGAATTCTA CAAAATATCA AAGGTCTTAA 
213121 TGATTTTCAT TTCAGGGAAT GGCATGGACA GGTCAGTTTA CTATTTGGGC AAAGTGGGCT 
213181 CCTCCACTTG AACGAAGCAA GCTCACCACC ATTGCAGGAT CAGGTAAGTG TGCACAGATG 
213241 GGTCATAGCT TTGTCATCTG TTCCATCCCA CTGTGTCTTA TCTTCTATGA ATCAAATGGT 

2133 01 TTGGGGAAGA GAGAGAAAAA GTACTGCTGA AAAATTCAAC AATATAAGAC ACTTGCATCA 
213361 CAAATAGGAA AGATGCATCT GTGCAGTAAA GACATTGAAG CTTAGAAGTA GAAAAAACCA 
213421 TTGTGAGCTA GGTTTCAGCT CAGAAAAGCC TTAGTAGTCA GAAAAGCCTT AGTAGTCAGA 

2134 81 AAAGCCTTGT CGGAAAAAGT TTAAACCTTT AAGAATTGCA CACATGGAAA AAGATCAAGT 
213541 AAGCTATATA TACAC CATCT TAGCAATGAT TTTGAAGTGA GAATTAAGGC TACCACAGCT 
2136 01 CCAGGTGGTA AG GAG AG AAA TCAGGCTGGA AGAGTTTGAA GTTTCTGTAT TATTCTAAGC 
213661 TCTTTACTAT TCTATTATGA G CTC ATTAAT TCTCACAACA ACCCTCTCAT ATAAGTACCA 
213 721 TTTTAAATTC TTATTTTACA GAGAAGGGAG TTAAGGAAGG TGGAGATTAA GAAAATTGCC 
213 781 CAAATACAAA TAGCCAGCAG GTGGTAGGTC TGAGATTTAA GCCCATGCAG ATTTTAGCCC 
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213841 CAGAGCAGAC ATTCTCAATC ACTATGCTAG ACTGCCTTTC CATGGTATGT GATCCTACTC 

213901 AGGCCTCTAC AGCTTTATCA TTGCTGTTCT CCCCAGCCTG TCGTGCTGAG AGTATATACT 

213961 CGAAGAGCAG AACTAAAATT CCATCCAGCT TCTCACTCCT AGGTCCACTA CACAGCTGCA 

214021 TCCTGCAGAC TTTTACCTCA AGCAACCCTC CTGCGTTCTT GCTTCCTTCC ATCATAGTTG 

214081 TAACCATCTC CTCTATTTGC AAATACTATC TGCTGATCTC TCTCTTCTAG ACTGGTTTCT 

214141 TTCAACCTTC TTCCCACCAA AACCAAGTTA GCTTGCTAAA ATAAAGATGG CACATTTTTA 

214201 CTCACCCGCT TGAGAATTTT CAATGTGTTC CTTCATGCTT ACAGAGTAAA GCCTGACCTC 

214261 TTTATTGCAT GAATACAAAA GTTCTTAGCC ATCTGGCCCC AACCTTGTTC CACTCAACTC 

214321 CCCTGTGCAA GCATGGCTCC AGTGG CACTG GACATTGGCT GCTCTCCACA TAGATCTGCA 

214381 CTGCACTTCC CTCTGGCTCT GCTCCCGTTA GTTTATATGC CTGGAAAGTT CTTTGCCCCT 

214441 GTTCCTTGTG CCAAAATTCC ATCTATCCTA TTGCATAGCT TATGTAAAAA CTTCCTAAAC 

214501 CTTTTTTTTT TTTTTTTTTT TTTTTTTTTG AGACGGTGTC TCACTCTTTC GCCCAGGCCG 

214561 GACTGCAGTA GCGCTATCTC GGCTCACTGC AAGCTCCGCC TCCCGGGTTC ACGCCATTTT 

214621 CCTGCCTCAG CCTCCCGAGT AGCTGGGACT ACAGGCG CCT GCCACCATGA CCGGCTAATT 

214 681 TTTTGTATTT TTAGTAGAGA CGGGGTTTCA AGCCAGGATG GTCTCAATCT CCTGACCTCG 

214 741 TGATCCGCCC GCCTCGGCCT CCCAAAGTGC TGGGATTACA GGCGTGAGCC ACCGCGCCCG 

214 801 GCCAAAACTT CCTAAATCTT ATAATTATTA TCAATTTATC CTCAGATATA CTTCCACGTA 

214 861 CATTGTAGTT TTATTATATT TATATTTTAC ATCTTTTTTT TCAAATTTCA GTTTGGGACC 

214 921 CATTAGTGAG TCATAAAATC CATTG AG CGG GTTAAAATCA TTATTTTAAA AAATGAATAG 

214981 AATAGAATAG AAATTGTTGG AGTGCATTGG ACATGGTAAA GTTAAATATC GATTCATGAA 

215041 ACCATCGTTT GAGGCATATG TGTGTGGTTG TATGTACAAG TGTTTATGCA TATTGGTGTG 

215101 TGTGTTATGT TACCCTGTAA AATGCATTTC TTACTATAGG TCTCTGTGAA ATATGTGTCT 

215161 TGTTGTTTTT TAATGTAGAC TTCCAAAGCC TACATGGCAT TTCACTAGTG ACAATCAATT 

215221 TTATTCACAT TTTTCTCTCC AATTGGACCA GAAGCTCTTT GAGGGCAGGG GCTGTATCTT 

215281 ACCGATTTTT GTAAGTCTTT CATTTCCTGC CCCTAGCCTC ATATTAGATC ATG CAAGAAT 

215341 GCAACTGTAA TCACAAGAAA ATGCTAATGG GCTGTGATAG CAGAGAGTTA CTGTGACAAA 

215401 CTAAGGGATT TAGATTTGGT CACATTGGTG TTGAGGAGCC ATTGAAGAAT CAGAGAGTGT 

215461 GTTACTATTA TTTGTTAATT TTAATTATAT CATATTACTT TACTGGGGAA AATCTGTGAG 

215521 CTATTTTAGA AATAAATACT CTCATTGCCC AATAATTCTA AGTCTGCCAC CTCACTGTTG 

215581 GGACATTGTT TAGGGAGGCC ACGAAGTCTC AGCCTTTGAT ATTTTCATAA GTGTTTTTCT 

215641 CCCTTTTTCC TTTAGGGTCA GCATTTGGAT CCTTCATCAT CCTCTGTGTG GGGGGACTAA 

215701 TCTCACAGGC CTTGAGCTGG CCTTTTATCT TCTACATCTT TGGTGAGTCA CTTTCTCTTA 

215761 AATCCTAACG CCTCCATTTC CTGAGCATCC ATTTTGGCAC CTACACCACC CACATTCTTC 

215821 CTATATGAAA GAAAATGTCC TTTATCAAAT GGAAGATGAT AAAAAATGTC AACGGTTGGT 

215881 ATCATTTTTA ATCTAGTCAC ACAACCTGAT TAACACCTTC CTGGTGGTTC TGGGAAGCCA 

215941 CACGCACAAG GTAGAGGAGT TGACTATTCA CATGGCACCC ACCGACTTGT GATGCAGTCT 

216001 TGTCCTTCCA TATCAAGCAC CTTCTGCAGA ATCTCTACCA CCACATCTGA AGTGCCTGCT 

216061 ATATGCAGTT AAGATGTCAA AGATAGTGAA GTACATTTTC AATGTGTCTT "ATATTTCAT 

216121 TATAATTATT ATTTCTGTCC AAGATGCCTT TCACCTGTTC TCTACCAAGT TAATCTTGCA 

216181 AAGTTCAATT CAAATGTTCC CTTCCCCATG GGCCCTTCCA GGGCTTACCC TATCAGATTC 

216241 TGGCATTCTC TCCTTTATGA TATTTCCTCT CTAGGTTATG TTGGTGTGTA ATTATTTATT 

216301 TCTCCTTTTC TTTCCACTAG ACTGTGAAAT GCTTGAGGCA AGGAATCCAT TCTATGTTTT 

216361 CATCACTTGG GTGTCATCAT GGTGCCTGAT TTTTAGCTTT AAAATAAAAG AATCAGTGAA 

216421 TCCAGTAATT AGAGGGGATT TAAAGAAAAC TAGTCCTCAG AATCTTTTAA CATAGAATGT 

216481 TCTTCAAATA AGGAATTCCA ATAATAAGAC AATTTTCTAC ACTTGATTTT GTTTTTATAG 

216541 CCAAATGGTG TCATTAAATA TAGTCCTGGC CTGAATGGCT TTCTCATTAA TGATGCTAAT 

216601 TATTTTGGTT TGTACATGTT AACCAGGTAT TGTACAAAAA TATTTCTTTT GGGAATCCAT 

216661 AATGGATGTA TGGCTTGAAT ACAAATAATA CTGTCTCTTG TAAGTG CATT GGAAATTTTT 

216721 CCCTGCCACA TGATTTCATG GAAGGTTGTT TCGTGTATGT ATGACTGCAA ACCTGACTAT 

216781 TCAGATCTTC CG CAACAAG A CAACTTATGT GTGCATTAAG AAGTTG CTGC CTAAAATACA 

216841 TAACACTGTA ATCATTGGAG ACTTTAAAGT AATTAATCAG CTATGCAATG CCACGCTCCT 

216 901 GTTATCTCCA GAGGGCTCTG ACATTGACAA ATGGTGGCTT TCTATTTGAG ACGTAATATC 

216 961 TAAAAAG CTT TAACAGGTTT GTAGAAGGAT TGAAAGAAAG AATGGGAACA TTTAGGTCCT 

217021 TATGGTAGAA TAAGCATTAA TTGATTAGTG TGTAGAAGGG AGAGGCATGC CACTTCAGAG 
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22 0321 AGGG CTTTCT GAGGAGGGTC ACACAGAAGA CCAAAGAGAA CTCATGTTGA ATTGAGATGG 

220381 GTTGTAGTGA TAGTTGTCAA C AG CCAATAC AGAAACAAAA AAAAACAAAA CAAACAGCAA 

220441 CAACAACAAC AAAAAAAAAC AGAGAAGACA CAAACACAAT GCCACAATGC CATTTTAGGC 

22 0501 ATAATTTTAA ATGAGTAATA TTATATGTTG AAATCCAAAT TTTCAGAAAA ACATTAGTGT 

220561 ATTTTATTTT TGTTTAAAGA AATAAC CATC TCAACTCAGA ACCCCATGTG CATTTTGGCC 

22 0621 ATTTTGTTTC CAATAGTTTC ATAAACTTTC TTAAGTAACT ACTGCACATT GTTCCTTATA 

220681 TTCCTTGTGA TCAACATTGC AATACACAAC TGGGAGGGCT ACTAGAACTG GTGTAGAAGG 

22 0741 AACTTGTGAG ATTGATCATT TTCTCTGTTT TTTACATCTA GGATTTTGAG TCTGGTTGGA 

2 2 0801 GGAATGTCTT TTTCCTGTCT GCTGCAGTCA ACATGTTTGG CCTGGTCTTT TACCTCACGT 

22 0861 TTGGACAAGC AGAACTTCAA GACTGGGCCA AAGAGAGGAC CCTTACCCGC CTCTGAGGAC 

22 0 921 ATAAAGTTAC AAACTTAAAT GTGGTACTGA GCATGAACTT TTTAAACATT TTTTACTTCT 

22 0 981 ■ CTCCATATTC CTGACCATAG ACTCAGCAGT TCTTAACTCT GGCTGTGTGT TAGTCTTCCC 

221041 TGGGGAGCCT TTATAAGACA CTGATACTTG GGACCCACTC CAGAGATTCT GAATGAATTG 

221101 GTCTGGGGTG GAACCCAGAT ACTACTAATT TTTAGATACT CCTTAGAGGT TTCTAGCATG 

221161 CGCCCGGGGT TGACAACAGC TGGACAAACT TGAAAAGTCA ATTCATGTGG CCTTTGAATT 

221221 TTCCTCATTG GAAAGTACTA AATAAATAAA AATTCATGTG AAAATGATCA CTGATAAATA 

2 212 81 TCTTCATGGT GGGGCAGGTT ATTGGATGCA GAGAAGATCT GCTCGGAATT GTAG CCATAT 

221341 GTTACAGATC TCAGCACCGA TCGGAACTGT AAAGCTATAA TCCCCAGAAT TAAAGTTTTT 

221401 ATT ATTTTTT ATACATTGTA AAACATAGAC GTTTATTTAT GTGATTAAAT TCTATTAAAA 

2214 61 TTTACATGCT AAAATAAAAT AGACCATTTT CAAATTATTT AGATCCAGAT ATTTCCATCA 

2 21521 GATTAAACAG ATATTTATTT ATCCTAGCCC AATTGCAAGA GATTAATGAT GAGAAAATGA 

221581 CCAATACAAG ATTAAATAAA TGAGGTTAAC TTAGAAATCA AGGACAGAGA AGATAGAACT 

221641 GGAAGGCTTG TATTGTGAGA AGAATGAATG TGAAGGAAGG CAATGTAGAC ACTTCCAGAA 

221701 GGGATAGCAA TATAGTTTAG ACCATATAAT GAAAATTGGA GAGAGATGAC AGAGACACTT 

221761 TCAAGTGAAA TGACAATTTA TATGGGGGAG AAAAATATTG AAGACATAAC AAGATGAGAA 

221821 AAGGCATAGA AATGTATCAC ATACAAGGCA TAGAAGTGTA TCACATACAA GAGAAGTTCC 

221881 TTTTGAGCGT AGAAAAAGAT AATTTAACCT T CTTC AT ATT TTTCTTACTT TCCCAAGATA 

2 21941 CTCAGATAGG CAGCGTCAAC TCTAACAGGA ATTAATTTGG CTCCTAACAC TTAAGACATA 

222 001 TCCTTTAGTT TGTCTCCTCA CACAGAACTG ATTCTGGTTT TGCCACAACA TGTCTAGAGA 
222061 AGAAGTTCCC ACCATATTTT AAATCCTATT AAAAAACTGC TTGGACAAGA ACCTTGGGTT 
222121 AATTCAGCAG ATGAAGAGAA TCTCCTAATG CAAATCAATG GGTATTTTTG AGCAAGTTTT 
22 2181 T C AG AAAAAC AGAGTGTCAG GCCCTGAGGG TGGTACTAAG ATGAGAACAT TGATTTTGCC 
2 22241 TTCATGATAT TGACAACACA AAGAGGAAAG GGGGTTTGCA GAAAACTAAA AGAAGAAGTA 
22 2 3 01 GAAGAAAAAA GAAAGACATA GTATAATAGG TAGTCAAATT ATGTACAGAA AAAAGAGAAA 
22 2361 AAAAAAACAA AAAAGGGTGG GGGACAGACA ACCCAACTAA AAAATGGGCC AATGACTTGA 
22 24 21 ACAGGGACTT CATAAAAGAG AAAATGTAAG TGGCTCCTTA ACATATAAAA AGATGTTCAA 
22 2481 CTTCATTAGT CATTACAGAA ATGAAAATCA AAACTACAAT G AAATAC C AC TATAAAATTA 
2 2 2541 ACTAATGGAT AAAATGAAAG GAGATGGAAA ACAAAATGTT GCCAGACATG TGGAGCAACT 
2226 01 GGAACTTTCA TACGTTACGA ATGTGAACTT TGGAAAGCTG CTCGGCAATA TCTCCTAAAG 
222661 CTAAATGTAC AATTCCAGTG ACTCAAACAT TTTACTTAGA AATGCACATA TACATCCATA 
222721 AAACATGTAC AACAATGTTC ATAGGAGCAC TATCTGTAAT AGCCTGAACA GGAAGTTGTC 
222781 TGTTAAAAAA AGAATGAGTA AATAAACCAC GGTCTATTTG TATAGCAATG AGAATTAACA 
22 2841 GACCCCAATA TATAATAGAT GAATGGGTCT CATAAGCACA ATATTGATTA AAGGAAGACA 
2 22 901 AAACGCACAT TCTTTTAAAG GTTTATAAAA TACTTTTTAA AAACAGCTAC AACCAATCTG 
222961 TCCTGTTAAA AATCAGTGAG CGATTTCCCT TGTGCAGGGA TGGGGGTTGT GGCTGGATGG 
22 3 021 ATGGTACTTA AGAAGTGCTC CTGGGGTACT AGAAATATTT TATTTCTTGA CTTGGATGTG 

223 081 TGTTTACTTT GTGAATATTG TACATTTATG ATTTGTGCAC GTTTATGAAT GTAGAAAATA 
22 3141 AAACAGAAAG CAAATTCAAA GTATCATCCT TTTGAGAGCT TCTGCTCTGA CTTCGTTTTG 
223201 ACCAATGGAG CAGTTGGGAA GGGGTCTTGG TCCTTCGGTC CTTTGCTTTT TTTTTTTTTT 
223261 TTTTTTTTTT TAGACAGAGT CTTACTCTGT CGCCCGGGCT GGAGTGCAGT GGCTCGATCT 
223321 TAGCTCACTG AAAGCTTTGC CTCC CGGGTT CATGCCATTC TCCTGCCTCA GCCTCCCCAG 
2233 81 TAGCTGGGAC TACAGGCACC TGCCACCATG CCCGGCTAAT TTTTTGTATT TTTTAGTAGA 
223441 GACGGGGTTT CACCATGTTA GCCAGGATGG TCTCGATCTC CTGACCTCGT GATCCGCCCA 
223 501 CCTGAGCCTC CCAAAGTGCT GGGATTACAG GTGTGAGCCA CCGCGCCCGG CCCCTGGTCC 
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223561 TCTGCTTTCA TGTTCTTCTT GGTCCTGTTC CTCCTCCTCT TTTGTTGGAA CTTCCAGTAT 

223621 CAGAGCAGGA AGGAAGGCAA TGGGTCAATC GATGCTGTCA GCTTTTGGAT CAAACTG CAA 

223 681 GTTCTCAAAC AGCAAAATTA ATGAGCTCAG GCTTTGAAGA AACCATGACC CTG AAAG CAT 

22 3 741 CAGTTGCTTC CAATTGCATC AGTTGCCACG GGTGATAAGA ACAATGATGA CTCAGAATGC 

223801 CTAGGTTTTC C C AG CAGCTT CTCTGAGGTT TTCCCAGCAG CTTCTCTGAT TGATTCCTGA 

223861 CAGATGACTT CGGTGTGTCA GACTTTCAGG GTATCTTTCC TTATGTGATG GTTTGAGGAA 

22 3 921 GAGTTACCAT TCACATTCCT AATGGCTTCA GAATAGATGC AATTGTGAAC TGATAGGAAA 

223 981 CATTTCTAAT TCATCTCCCC TCCCCATCCC TAAAGGATTG TTTCTAACAA TAGTCATGAA 

224 041 AATTAATTCA CTTTTCTCAA ATAGTTTATT GTCATCTACC TAATGATGAG ATGACTTACT 
224101 TTTTCTCCTT GACTGTTAAA TATTATGAAT TATATTAATG TATTTCTTAA TGTTGAGCTT 
224161 TCCCTTGAAT ATTCTTTTGA TGTACGACAG AATTTGATTC ACTAATAGTT TATTTAGGAC 
224221 TTTGGCTGAT GTACTGATAT ATGAGATTGG CTCTGTATGC ATACATGTGT TTTGTGTATC 
224 281 TTTTTTGTGT CTGGATATGG AG CTTATGCT GATTTCAAAA ACAAGAAAGG AGAACTTTCC 
2 24 341 TTTTTCCCCA TTACTCTGAA AAAGATTGAC TAGAATGGAA TTTTTATAAT TGCTGTTGTT 
2244 01 ATT TG AAAG C TTGAAAGCAT TGGTTTGTAA AAATCATGCA GGCTGAAAGC CATTTTGAGG 
2244 61 AGACTTTGAT AACTTTCTCA ATTTCCTTCA GTTACTGGTC TTTTAAGGGG TTTTATATTT 
224 521 TTCTTTGATC AATTTTG AC C ATTTATGTTA TCTTGGAGGA TCATCTATTT TACACACTAT 
224 581 TTAAAGTATA TTTGCAAAAA TTCAACTGTT TTATCAGGCT ATCTTTTTAA TAATATATTC 
224641 ATTTTATCTA TATCTGAGGT TTTAGCTTCT TTGTACTTCT GACCCAATTG CATGTGTGCT 
224 701 TTCTTTCTCC TTCATTAGAC TACTTAGTCA TTTACTAATT TTAAGAATAG CTTGTCTTTT 
224 761 ATTTATTTAC TTATTTATTT TTGAGACGGA GTCTCACTCT GTCACCCAGG CTGGAGTGCA 
224 821 GTGGCGCGAT CTCGGCTCAC TGCAACCTCC GCCTCCCGGG TTCAAGTGAT TCTCCTGCCT 
2 24 881 CAGACTCCCG AGTAGCTGGG ATTACAGTCA TGCACCACCA TGTCTGGCTA ATTTCTGTAT 
224 941 TTTTAATAGA GATGGGGTTT TGCTATGTTG GCCAAGCTGG TCTCAAACTC CTGACCTTAG 
22 5 001 ATGATCTACC CACCTTGGCC TCCCAAAGTG CTGGGATTAC AGGCATGAGC CACTGCGCCC 
2 2 5061 AGCCCTGCTT GTCTTTTTAT TTTATATTTG ATTAGCTTTA TCTTTTATCA AG CTT ATGTC 
22 5121 CTATTTCCCT TTGCTTTACT TCATATAAAT TTTGTTTTGG ATAGTTTATT TATTTTTCAT 
22 5181 TTAATTATGA AACAGGTTAA AGCTTAGAGG AAAATTGCTC CTCTAAGTCC AATTTTGTGG 
22 5241 GCAGATTACA TTTTGCTGTG TTGTGCTCCC AAATTCATTG TTCTTTTAAT GCTTTATTTC 
22 5301 TCAAGTTAAT AACCTATATA GTAAAAAAGT GGCTGTTGAC TCTCAGCTTT TTTTTTTTTT 
2 2 5361 TTTTTTTTTT GTAGATACAG GGATCTTGCT GTGTTGCTCA GGCTGGTCTG AAACTGCTGG 
22 5421 CTTCAAGGGA TCCTCCTGCC TTGGTCTCAC AAAATGCTGG GATGACAGAC ATGAGACACC 
22 5481 ATGCCTAGCC ATGTCTCTCT CCTTATATAT AATAAGAAAA CAGACACACT GAGGCATCCT 
22 5541 ATCATCTCAC TCTTGGTTTC ACTACTGTTC TCTGGAAGTT TTGCTCTGAC CTTTTGCAGT 
2 2 5601 TAATGTATTA ATTTTG C ATT GAGTAGTTTC CATAGAAGAA TTATAGCATT TGCATTCTGT 
22 5661 TGGGTATTAT ACTTTTCACT GTTATTTGAA CATAATTTGA GGGCTGAAAC CAAGATGAGG 
22 5721 CAAGTGAGGT GCCCAGGAAG CAATATTTAA GGAGGCATCC TTTCTTAGGC TCATGCAAGA 
22 5 781 ACAGAATTGG CACATGAGAG TGAGTGCCTC CTTAATTTTG AGTGCTGGAC ACTTCTTGCT 
22 5841 CACTTAGCAT ACCCCTGGAC AATGAAGTGT TTTTTGTTTT GTTTTTTCAT GTCCATCCTT 
22 5901 TATCCTTCTT CATCTCAAAA CATTTCAATG GAGTATTTTT TTGGAGCAGT ACTTGGATGA 
22 5 961 GCCTCTGAGT CCCACAGTAG CTGAGAATTT ATTTCATAGT ACTCTTTATG ATCACTGTGG 
22 6 021 AGCCTTAAAA CATTGTAATA TTAACTTAGC TGGGAACAGA AATTTTGTTC CACAATTTGT 
226081 CTTATTCAGA ACAGTATTGA CTTCCTGCTA GTCTCTTCTG ATGTCCAATA TGAGGAAGTC 
226141 TAGTTAGCCA GCTACTTTTT GTAGGAG AG C TATGTTTAGG CTAGGTGCTA TAGGATTCTC 
226201 TTTATCCTGG AATTCCTTCA CCAAGATGTG CCAAGGTGTT AATCATTTTC TCTTGCTTTT 
226261 TGGCTGGTGG TCTTAGAGTT TCCTTCGATT TTGTTTTATT TAGTGATTGT CCTCAATTTG 
226 321 TTTTCTTTAC TAAGAATCTC TCTTCTATTT ATCTGTATGG TAAAACCTTG TTGCCCATCT 
226381 TTCTGGTTTC TGCTGACTTT CATTTTTGGA CCTTTTACTT TGCTTTCTCC ATGGACTTTT 
2 2 6441 TGGTAGTGGA GGC AGG C AAA CACTTTCCAA AGTCTTTCTC AATTTCCATC AATTTCAACT 
226501 TATTTCCTAA AATTG CCTC A GAATGTGCCT ATGTCCACAA TATCCCTCCT TCCACTTTAG 
22 6561 AAAGGAAAGG CATCCACACT TTATTTAGGT GCAATGCCTG AAGTGTAAAC ACTTTCTGGT 
226621 TGTCAACAAA GGAGTACTTC CAAATATTGG TTTGGGGATA ACCTGCTAAT GATTAACACA 
2 26681 TTCACCTTGG CTCTTGGTTT GCCTGCTCCC TCTTCTTTTA TCTGCTGTGT GTATTTTTTT 
226741 TAATCACTGA G AATATG C AC AGTATTGTAT GTTTTATTAT AAGAGAGGAC TGGCCAGAGT 
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22 6801 GGGAATGTTC TGAATTCAGA ATAACTGAAG CAGTACAGGA TAGGAACTCA TTCTTTCAAA 

22 6861 TGAAGCTGGC ATATTTTCCC AGAGCAC CAA ATTTCAATAT ATATTTAAAA AACTTGATAT 

226921 GAATGATACA ATAAAGTGGT TAGAACTTTT ATTAAAATAA ACTTATGTCA TGAAATACTT 

22 6 981 ATTCTAATTA TAGTCACTCT TCATCTTATT TCATCTTATA ACATGTTTAA TGTTTTCTTT 

22 7041 TATTTACAAA ACAATTTATT TTTTGATGAA AAGTTTTAGA AATCAAGTTA AAAATATTCA 

22 7101 AAGGAATGCC TAAAGTTTTC AAAATTCTTT TACATGTTGT ACAATCAAAA GAGTCTGAAG 

22 7161 ACCATTTAGC TATCCAAATT GTTTATTTTT AAGCAGTATC CCTTCTAATA TTTACTATTT 

22 7221 ATAATCCTTA AAAATTTGCC TTAGCACAGG AGAATTGCTT GAACCCAGGA GACGGAGGTT 

22 7281 GCAGTGAGCC AACACAGTGC CACTGCCCTC CAGCCTCGGC GACAGAGTGA GACTCTGTCT 

22 7341 CAAAAAAAAA AAAAAAAAAA AAAAAAAAAG GCCAAAAACA AATAAACAAA CAAAAAAATC 

227401 CGCCTTAACA TTATTTGTTC ATTAAAAACT TTCTTTAATA CTACTAGTTT CCCTTTCCTC 

22 7461 TCAGCCCATT GTCATATTTT GATTTTTATC ACTTGCTTTG TAGGACATAT GAGGTTTTTG 

22 7521 TTTTTTTTTT TTTTTGGAGA TGCAGTCTCC CTCTGTTGCC CGTGCTGGAG TGCAATGGCG 

22 7 581 C AATCTTGG C TCACTGCAAC CTCTGCCTCC TGGGTTCAAG CAATTCTCCT GCCTCAGCCT 

22 7641 TCCAAGTAGC TGGGATTACA GGCACCCACT ACCACGCCTG GCTAATTTTT GTATTTCTGG 

22 7 701 TAGAGACGGG GTTTCACCAT GTTGGCCAGG CTGGTCTCGA ACTCCTGACC TCAAGTGATC 

2 2 7761 CACAATCCTT GGCCTCCCAA AGTGCTATGA TTACAAGCAT GAGCCACCTG CCCAGCCAGA 

22 7821 ATATATGTTC ATTTTGAGTC CTTTAACAAA GTCATAAGAA TTTTAGGAAT TCAGTTACTT 

22 7881 TCTTGAGAAA ATCTCTGAAA AGATGCCAAT AATTTGT AG C CAATTATATT GATTTCTCTT 

22 7 941 TTTCATATTG AGAATTGTTT TTTAAAAAGT TTGTATGTGT GAAGATTTTT GCACTGTAGT 

228001 TAAAGAAACC AC CTGTGTGT TGGTTAAGCC ATAAGTACAT GTATTCAAAT AAATTGAGGT 

22 8061 GGGGTTACTC TGAGAATCAA AGGAAAACCT GAAGAAACAG GCAGCCTCAA AAGGTCTTAG 

22 8121 CTGTAGCAAC TTGCTCCATT GTTGAAATAA ATAGGCTTGA ACTTGTATTT TCCCTCTACT 

22 8181 CAACATTTAA GGTCTCAGAA GATAATATAA TTGGTGAAAT TTAAGTAAAG TGCTCACTCT 

22 8241 TTTGCTTTAA CAAACCCTAG AGAGCTGGTA GGCAGAGCCT CAACAGACCG TTTTAGCTTC 

228301 CAAAGGGAGT TCAGGACACC ATGATTCACG ACCACAATAC ATCACACATA ATTGAGAAAA 

228361 GATAGTTCCA CCAAATAAAG TTGAAATGCT GACAAGAAGG GGTAAGAAAT CTTGGAAATA 

2 2 8421 AGTTTATATA AAATTTATTT TTTCCTTTTT TATTGTTATG GAATAGGACC AGTTCTACTT 

228481 AAGCCACCCA TTTGCCAAAA TAAAGTGAGA ATCGTTTCTT TTGGGGACTC CTCTTTGTAG 

228541 CTCCAAGTGC CACTAACAAT TCTTAGGACC TGAGCTATAA G CCAGGTG AT TTCAGTTAAT 

228601 ATGATCAATT ATTTCATTTA AATGGCTCTA ATGTGCAGAG GGAACGGAGC CCATCAGCAT 

228661 TCCCTGCAGG GAACTGCAGT GGCTTTTATC AACTTGAACA GCTAGCTTTC AACTGTTTTG 

22 8721 AAATCACTTT CAGGGTGGTC ATGTAGTTGC TTTTTTGAAA TCAGAAGATG ATTCTGCCTC 

22 8781 TTTTAATATG TGACTCCTCA GATTCAGAAA GTGCTCGCTA GTCTTAAGAG TGAATTACCC 

22 8841 TCAGTGGTCC AGCGCTTATG AACCCACATC TAACCCTATC CCCTGGGGGA ACTATCAGAG 

228901 AAATTGGTGC CATGGACATA AGAGGAAGGC ACAGTGAAGC AGAGAGCCCC GCATGATGAA 

22 8 961 AATCAGTGGA CAGCATCATT ATTTACAACT TTGTAATCAC CCAGGAGCAT GAAAATCCAG 

22 9021 GCCAATCTGG CACCATGAGC TCTAATTTTT GTTGGAGTTC TTGGAACCGA TTCTGATGAA 

22 9081 TGACTGTTTA GCCATTTTAG AGTGTGGCAT ACGTGGCTGC TGG CAT AC AG AGGTTGGATG 

22 9141 TAAACGGGCC TTTGCCCTCT CTTATGAACA TAGACAGGAA CTAAACTGTG TCACATAGGT 

22 9201 TCCAAATGGT GGCCTGAATA CTATTTACAA CTAAGGTACA ATGAAATTGA GTAAGTCTTT 

22 9261 TCCTCTTTTG CAGATACCAT CATTATTCAT ATATTTCTTC AAAGTTAACT ATTTGTATTT 

22 9321 GGTAATTTTT AATAGAAATG TAATAATTGC TTCTCAAGTT TAGTCTTTAG TCTTAAGGTT 

22 9381 GATGCTCTCC ATGTCCTTCC AAAAAAAGGT ATGTTGCTTT TATTATATCC TCGCCTTCAG 

22 9441 ATGGGATTAT TCCATTTTGT TCTTTGTTAA TATATACTTT GAGCCACTTT TTTTGTGGCT 

2 2 9501 CTGGGTGAGA TGCTATAGGT ACAATGACAA GTGATACGTG TGTTGTCCCT GTCACAAAAG 

22 9561 TGGATAGCCT AAGTGGTGAC TTTTACCTCC ACTCCAAATA TATGTATCAC ACACCAGCCG 

2 2 9621 TATGCCAGGC ACCACTCTAG GTGCTAGGGA TACAGCAGTA AACAGACAAA TGCAACCCCT 

2 2 9681 GCCCATGTGA . AAGAGAATAA GACAATAAAT AAGTAAAGTG CATGTTATAT GGAGGTGGCA 

22 9741 AATGCTAAAA AGAAAAATTA AGCAGGCAAG AGGACTCATT GAAAAGATGA CATTTGGGTA 

22 9801 AAAGCCCATG TATATATGTT CTATTGGTTT TATTTCTCTG GAGAGCCCTG ACTAATACAC 

22 9861 AATGACTTTG AGAAGTTACT GGCTTTTGAT TTATCACACT ATTCGGAGTG CTGAGAGCCT 

22 9921 TCTTAGTGTG TATTCAGTGT TTTAAGAGAG CTTGTGGATG AATAATAAAT AGGACAAAAT 

22 9981 TTATCCAAAC TTAAGCCTTG CTTTAGGTAA AAGGGCTCCT CTTACAAGGT AGAAGGTTAT 
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2 33281 ACCTTAACAG TCCTGAAGAT CATTTGCTTT TTTTTCATAA TTACACCGGA GTTATAGATT 

2 33341 TTTTGAAATA ATACCACAAG GGCAAAGGGC CCTTCTTGTC ACATCATTTT AGGGAGAACA 

2334 01 TGATATCCAC ATGACATCAC TGATATTAAC CTTCATCATG TGGTTTAGGT AATGTTTCAG 

2 33461 GTTTCTCTAC TGCAAAGTGA TTTTTTTCCC TTAATTT AG C CCACCTGAAC TTATCAATTT 

233 521 TGTTTTCTTC CATGACTAAT ACTTTTGTTA TTATAGCTAA AACTTCATTG GGGCCAAATC 

23 3 581 TTAGATCATG TAAATTTTCT TCTATATTTT ATTCTAAAAG CTTGTAATGT TTGATACATT 

233641 CTAAAAGATG TAATGTTTGA TACATTACAT CTAGTCCTTT GATTTATTTT TAGTTACTTT 

233 7 01 TGTATAAGGT GTGAGAGATG TCTCCAGTTT CACTTTATTA ACACATTGTG GTGTTCCAGT 
233761 ACTATTTGTT GCTAAGACTA TCTTTTTTCC ATTGATTACC TTTGCCTTAG TTGGCAATAT 
233821 TTTTGTTGGT TTATTTCTAG ACTGTTTATC TCATTCCACT GATTTGTGTC TATCTTTTTG 
2338 81 ACAAAACTGT TGATTACAGT AAGCTTTGAA ATAGTTCATT TTTTGTGTCA AC TTG ACTG A 
23 3 941 GTCAGGGGAT AACCAGCTAT CTGGTTAAAC ATTATTTCTG GCTGTGTTTG TGAGCGTGTT 
2 34001 TCTGGATGAG ATTAGCCTTT GAATAGGTGA TCCTAGTAAA GTAAACTGTC TTTCCCAGTG 
2 34 061 TGGATGGCAT TATGCCACCT GATATTCAGG GTCTGAATAG AAGAAAAGGC AGAGGAAGGG 
234121 GGAATTTGGG CCTTTTTTTC TGCCTCACTG CTTGAGCTGG GACATCTCAT CTGGTCTCCT 
2 34181 GCTCTTGAAC TGGGATTTAC ATCATCAGTT CCTCTGGTTC TCAGGCCTTC AGATTCAGAC 
2 34241 TGAATCATAC CACCAGCTTT CCTGGGTCTC CAGCTTGCAG ATTACAGATC ATGGGACTCC 

234 3 01 TCATCTTCCA TAAATGCATG ^AGCCAATTCA GTCTATGTCC TTGAAAACTG CCCCACTGCA 
234361 GATTAAGGCT TTTTTCCACT AGGTGAAATA AAGAAGCTTG TTAGACAGAT TTCCCTTCAT 
2 34 421 CCAGTGCCCT CTCCTCTTTA AGTTACAACA CATTGGCTAC ACCTAAGTGC AGGGGTGGGG 
2 34481 ATGAGGGTAT AGTCCTCTTG TTTGCTGAGA AGAGAACTGT ATTGGGAAAG CTCTAGAAGT 
234 541 GTTTGATACA TACATAAACA AGGCATGGTT TTTGCACTTA ATTTCACATT ACATTTTTCC 
2 34 6 01 CAGAAAAAAA GGAATGTATA GGCATCACGT AACTGTACTA GCTGGAGTCA TTCTTCCTGA 
2 34661 TTATCAAAGG TAAACAGTTA TTAATCCTAT ACCAAGATGT CAAGGAGAAG TACTTTTGGA 
234 721 ACACAAGGAA TTCTCTGGGA GTCCTTACTA CTCTCAAGCC CAGTGAAAAA GTTAATGAAA 
234781 AACTATAGTA CCTTCCTATA AGCTGGATGA CTAATTACCA GGCTCATTTA GGAATTTGCC 
2 34 841 TTACCAAGTA AAACATAAGG GCAGCTGAGG TGCTGACTGA AGACAAATGG AG C AT AG AAT 
2 34 901 AAGAGTAGTA AAGAATGC C A AAAATGCTGT CATGTATCCA TTGACAAAAG GAGCTATAAA 
234 961 GCCTTTAGGT ATTTTCACAC TTGCTCTGTT ACGTAAATGT ATGTGTGTGT GTGTGTGTGT 
2 3 5021 GTGTGTGTGT GTG 
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1 CACACACACA CACACACACA CACACACACA CACAAATGAG GTATATAAAG GGTCTCCTAA 

61 AATGTCATCT GATATTTGTT ATTTCATATT CTCAGATTTT TAATCCATTT AGGTAGGTCT 

121 ATTTTAGATA GCCTTGTCTG AAACAGAGCT GGGACCTGAT GAGTGAAAAT GAGCTCACCA 

181 GAAGAAAAAT CAAACAGGCA TTTCAGAGAT TGAGGCCAAG AAGTTAAATG TCTTAAATGG 

241 GCAGAGCTTA GCTGCTTGAT GTGAAAAGAG ACCAGCGTGG CTGGAACAGC AAAGGAGAAC 

3 01 AGCAGAAGAG GTGAACAGAG GCCAGAGATG GTCACTGAGT GGGCCCTTAA GTCATGGTAA 

361 GGAGTATGGA GAATGAATTA TTGCATGTAT TGAATATGTA GGTGACGTGA C TC AC AG ATA 

421 CTTTGGATTT GTAGAGATGA AGGAAATGTA GCAAGTGACA CTCTTAGAAT GTTGATTTGA 

481 GTAAATGGTA GTGTCAGTTA TTGAACTGGG GAGAACTGGA AGGGATAACA GGCTTAAGGA 

541 GCACGTTTAT TCCTGTGTCT TGGAAGTGTT TAGGGTGAAA GACCTATTAG AGTTCTAAAT 

601 GGAGATGTCA AGTGAAAATG TGGCTACACA CATTTGCATT TCAGAAAAAA GGTCAGGCTG 
GAGATGTAAA ATTGGAAGTT TACTGCATAT AGATAGTCTT TGGAACCGTA GTATTGATGA 
AGCCATTAAT GAGACAGAAC AAAGACTAGG GACCAGAGCC AAGCTCCAAG TTTCTAAAAT 
TTAGAGGATA GTATAGTCTG GTCATTTTGA GGTGAATACT TAATAACAGA ACAATTTGCT 
GAAGTGTAAA TTTAGAGCCC TACACTTTTA GCTCTGACTA TTAACGAATA CAGGAAAGAA 
TGGATATGGT TATCTGCCTG GTGTCTGTGA AATAATTTAA GCCAGGAAGA GATCCTCACC 

961 AGAAACTGAC TATGCTGGCA ACTTGGATCT TAGATTTCCA GCCTGCAGAA TTGTTAGAAA 

1021 ATAAATGTCT ATCGTTTAAG CCACCAGTCT GTAGTATTTT GTTATGGCAG TCCAAGCTGA 

1081 CTAAGTTTTG GTACCCAGGC GTGGGATGCT GCAACAACAA ATACCTAAAC ATGGGGAAGT 

1141 GGCTTTGGAA ATTGGTGATG GGTAAAGGCT GGAAGAGTTT GAGGTTCATA CTAGAAAAAG 

1201 CCAATTGTGA AGGGACTATT GAAAGAAATA TGGACATTAA AGGCAATTCT GGCAAAGGCT 

12 61 CAGAAAGGAA GAGAGCTGGA CAGAAAGCTT CCATTTTCAT AGAAACTTAG ATTTATAACG 

1321 ATCATGGATA GAATATTAAA TATGCTGGTT AAAATATGGA CTTTAGGCCA GGCGTGGTGG 

1381 CTCACGCCTG TAATCTCAGC ACTTTGGGAG GCTGAGGGCA CAGATCACGA GGTCGGGAGT 

1441 TTG AG AC C AG CCTGGCCAAT ATGGCGAAAC CCTGTCTCTA C T AAAAAT AC AAAAATTAGC 

1501 TGGGCATGGT GATGTGCTTC TGTGGTCCCA GCTACTCGGG AGG CTGAGGC TGAAGAATCG 

1561 CTTAAACCCG GGGGGTGGAG GTTGCAGTGA CCCAAGATCA CACCACTGCA CTCCAGCCTG 

16 21 GGATACAGAG CAGGACTCCA CTCCCCCCGC CACACACACA CAAAAAATAT ATATATATGG 

1681 ACATTAAAGT CAACTCTTGT GAGGTCTCAG ATGAAAATGA GGGACAGGTT ATTGGAAACT 

1741 GTAGAAATCA CTGTTCTTGT TACAATGTGT CAAGAACTTG GCTGAATTAC GCTGTAGTGT 

1801 TTACTGGAAA GAACTTATAA GCAGTAAAAC TGGATATTTA C C AG AAG AG A TGTCTAAGCA 

1861 AAGTATTGAA GGTGTGATTT AGGTCCTCCT TACTGCTTAA AGTGAAATGT GAGAGGAAAG 

1921 AGCCGAAATA AAGAAGGAAT TTTTAAGCAA AACACAATCA GAACTTGGAG ATTTGGGATA 

1981 GATTTCTCAA TCTATATTGT AAAAATTGAG AAAGTTTTTC TTGAAGAGGT ATGGTTGAAC 

2 041 AATGTTTTCT TTTTCTTTTT TTTTCTTGGT TTTATTTTTA TTTTTATGTT TTTTGAGACA 

2101 GGGTCTGGCT ATGTCATCCA GGCTGGAGTG CAGTGGCACA ATCTCAGTTC AGTGCAACCT 

2161 TTGCCTTCAG GCTCAAGCAA TCCTCCCACC TCAGCCTCCT AAGTAGCTGG GACTACATGT 

2221 ATGCACCACC ACACCCTGGC TAATTTTTTG TTGTTGTTTA TAGAGATGGG GTTTTGACAT 

2281 GTTGCCTAGG CTGGTCTCTA ACTCCTGAGC TCAAGTGATC TGCCCTCCTC AGTCTCCCAA 

2 341 AGTGTTGGGA TTACAGGCGT GAAACACTGA GCCTAGCCTG AACAACCATT TGATAAAGAG 

2 4 01 ATAATGGGTG TGACCCAAGG ATTTAATCAG CCATCTCAGC AGAAGCCAGG AAGAGAGATG 

2461 GGATTATTCC AG C AG AG AC A CTGCCAATTT AAACTAACGT AGG CAGAGAA AACAGAAAGG 

2 521 AACAAAGGAA GGTTGTCGAC TTTTTGAATT CTATAGAACA GGATCATAGA GCTACCTGGC 

2 581 TGTCAATGTG TACTATTCTT TAAGAAAAGG AAAGACTGAC CCACCAAAGG CAACTTACAA 

2641 GATCACTAGG GCTGACTCTT TTTTGTTTTT TCTTGAGGCA GTCTCACTGT CACCCAGGCT 

2 701 GTAGGGCAAT GGTGTGATCT CAGCTCACTG CAATCTCCAC CTCCCAGGTT CAAGGGATTC 

2761 TCTTGCCTTA GACTCCCAAG TAGCTGGGAT TACAGGCTCT AAATCTGTAC CCTCCCGAGT 

2 821 AGCGCTCCTG CCACCACTTG CCCAGCTAAT TTTTGTATTT TTAGTAGAGA TGGGGTTTCA 

28 81 CTATGTTGGC CAGGCTAGTT TGGAACTCCT GACCTCCAGT GATCCATTCT CATTGGCCTC 

2 941 CCAAAGTGCT GGGATTACAG GCAGGAGCCG CCAGGGCTGC CACTTTGATG TCAGACTCAG 

3 001 AGAGTACAGA TGGGATAGGG TGGGGGTGGG AACATGTAGT CAAGGCTGAC TCTACCTGTT 
3 061 TCAAAGATGC CCTGCAGAAC TGTGTGGGAG TCTCTCACAG ATGGCTGCCT GGGTGGGACC 
3121 CCACCAAACT GAAAGACCGA GACTTCAGGC AGGGCAGATG GAGTAGGCCA ACTACAGAGC 
3181 CAGAG GTGAC ACTGAGACAC CACTGGGCCT GGAAATCAGG GCATCAAGCC AAAGAGGGTT 
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3241 TTTCTTAAGA CCTAACAGAA TTTGCCTTGC CAGGTTTTGG ACTTGATTAG GACACATTAC 

3 301 ACCTTCCTTC TTTCCTATTT CTCCATTTTC TAATGGGAAT GTCTATTATG CCTGTTTCAC 

3 361 CATTGTACCT TAGAAGCATG TAACATTTCT GGTTTCACAC GTTCAAAGCT GGAAAGGAAT 

3421 TTTGTCTCTG GATGAATCAC ACATTGAGCC TCACCCGTAA CCTGATTTAG ATGATTTTTT 

34 81 AGATGACACT TTGAACTTTA GAATTGATGC TAGAATGAGT TAAGACTTTC AGGGGGCTGT 

3 541 TGGGATGGAA TAATTTTTTT TTTTTTTTTG AGACGGAGTC TAGCTCTGTC GCCCAGGCTG 

36 01 GAGTGCAGTG GCACCATCTT GGCTCACTGC AAGCTCTGCC TCCCGGGTTT ATGCCATTCT 

3661 CATGTCTCAG CCTCCAGAGT AGCTGGGACT ACAGGCGCCC GCCACCACGC CTGGCTAATT 

3 721 TTTTTTTTAT TTTAGTAGAG ATGGGGTTTC ACCGTGTTAG CCAGAACGGT CTCGATCTCT 

3 781 TGACCTTCTG ATCCGCCTGC CTTGGCTTCC C AAAGTG CTG GGATTACAGG TGTGAGCCAC 

3 841 CATGCCCGGC TGGGATGGAA TAAATTTATC TTGTATGGGA GAAGGACATA CATTTTGGCA 

3 901 GGTCAAGGAC AGAATGTTAT GGACTAAACT GTGTCCCCCA AAATTCATTT ATTAAAACCC 

3 961 TAAACC CCAG TGTGACTGCA TTTGGACATA GAGCCTTTAG GGGGTACATA AAACTAAAGA 

4 021 TCACAGGATA GGGCCCTAAT CCCATTGGGG CTGGTGTCCT TACAGAAGAT GAGACACTTA 
4 081 GAGCTCTCTC TCCACGCAGG CACCAAGGAA ACACCATACA AACACACAGT GAGATGGCAG 
4141 CCATCTGTTA G CCAGGAAC A GATTCTCACC ATAAACTATG TTGGCACCTT GATCTTAAAC 
42 01 TTCCAGGCTC CAAAACTGTG AGAAAATGAA TTTCTGTTCC AAGCCTCTTA GATATGGAAA 
42 61 AAAAGATTCT GTTGTTTAAG CCATCCAGTC TCTGGTATTT TGTTATGGCA GCCTGAGTAG 
4321 GCTAAGACAA TGAAGGATGT GGTAAAACTT TACGTCCCAA CCACATACCA AAGAGGCTGG 
4381 AATTTAGCAT GCTTTCTTCT TTCAACTGTA GGCAATGTGC ACAAGTTCTA AATCCTAAGA 
4441 CATGTTGGCT CCTTTACTCT GCCCAAACTA CAACTCAAAC AAACAACTGT AATATAATAA 
4 5 01 CATCCAATGA AGTTCTGACA TTTCTTCAAC ATGAGTACAG TAATTCAATG CCAGAGAATT 
4 561 CATTTTATTT TGAAATCTAC ATGCCATATT CCAATTTCTG TTGAAGATGC AATGGTTATA 
4621 TTTATTCTTT TTAATATAGA TTTATCAGAC TGGGCGCGGT GGCTCATACC TGTAATCCTA 
4681 GCATTTGAGA GGCTGAGGTG GGCATATCAC CTGAGGTCAG GAGTTTGAGA CCAGGCTGGC 
4 741 CAACATGGTG AAACCCTGTC TCTACTATAA ATATAAAAAT TAGCTGGGTG TGGTGGTGCA 
4 8 01 TGCCTGTAGT CCCAGTTACT AGGGAGGCTG AGGTAGAATT GCTTGAACCT GGGAGCAGGA 
4861 GGTTGCAATG AGTGGAAATC GCACCAGTAC ACTCCAGCCT GGATGACAGA GCAAAATAAT 
4 921 AAATACATAA AATAGATTTA TCAGTTTATC AATAATATAG TTTTCTTTTC TAGGTGTAAA 

4 981 TATAGGTAAT GACTGTCCTT TAGTACATTT TCTCATGATG CTCCTCTTAC TTGGTTTGGT 

5 041 ACAATATTAA GTATTGAAAT AAAATAGAGA ATCCTGTCGC TACACATGAG CACTTATTCC 
5101 ATTTGCTCAT CTCCAATATG CACGGGAAAT TCTCAAATTG CTAATAATCT TGTAACACAC 
5161 ATGCATTATA TTCAACAGGA ATATATAAAT TTATAATTAT AATTTAGGAT CAACAGATGA 
5221 CAAACCTTTA GAAGGTTTGT ATTTAACCTT AAAATATAAT TTTTTAAAAA TTGGTTATAA 
52 81 AATTTCTAAT ACTTTCTTTT TTGTGACCTC AAGGGGAAAA TATAATTCTT ATAAAAGTTC 
5341 AAATGATTTA CAGAATACAA AAAGTGAATA GAGATGATGA ATGAATTAAA GGAAAGGATA 
54 01 TTG CTAC AT A GATTTGGAAA TTTAAAAAGG GAAATTACGA TTGTTGATTT TGTGTTAAAC 
54 61 TGATCTGCTT TGTTCAAGAT ACCTTATGTA CCAAAAAATG ATTTTATCTC AGCCTCATAT 
5521 CTCAGTAAAT TCCTGAGACA AACTTTAGTC CCTGGTGCCC AGGTGCCTTT GGTAATTGGG 
5581 AGACCTCTAG GTTTAGCATC CTCATCCACT CGCCCCAATT TAAATAGTCC TCCCCAGGGC 
5641 CATTCAGGCA AGGGAGATGA AAACTTGCTC AAGAGTTGGA ATCCAATTGA AGCTACCGAA 
57 01 ATTCATTGCT CAATAGATAA TTTTCCCTGG AAGTAACTAG GGCTTTTGAA TATAATAGTG 
5761 GGCATTTCAA AGTAGAAGGT AAAGTATTTT GGAGATGAGG AGACAGGACA GAGCTACGAG 
5821 GAATGTCCTT TGCTCAGGGA CTAGGCTCTT AGCAGTACCT CTTAGGTAAG AACTGGTTAA 
5881 CTGGCACCTT CTGTGTTTCT CTGAAGCTCC CTTTGCTTAG GGACTAGGCT CTTAGCAGTA 
5941 CCTCTTAGGT AAGAACTGGT TAACTGACAC CTTCTATGTG TCTGAAGCTC CCAGAACAAA 

6 001 CTGCCAATGA AATTTGGATT TTTGGAATAT AGTTTCTTTT TTGTTGTTAC TTTTTGTTTT 
6061 GTTGTTTTTT TTTGAGAGTC TCACTCTCAC TGCAACCTCC CCCTCCTATA TTCAAGTGAT 
6121 TCTCTTGCCT CAGCCTCCCG AGTAGCTGGG ACT AC AGG CG TGCACTAGCA TGCCCAGCTA 
6181 ATTTTTGTAT TTTTTAGTAG AGATGGGGTT GGTTTTTTTT TGAGACAGAG TTTCACTTTG 
6241 TCGCCCAGGC TGGAGTGCAG TGGCACGATC TTGGCTCACT ACAACCTCCA CCTCCCGGGG 
63 01 TTCAAGTGAT TCTTCTGCCT CAGTCTCCTG AGTAGCTGGG ACTACAGGCG CCTACAGGTG 
6361 AACACCGCCA CACCTGACTA ATTTGTGTAG TTTTATTAGA GATGGGGTTT CGCCATGTTG 
6421 GCCAGGCTGG TCTCAAACTC CTGACCTCAG GTGATCTACC CACCTCAGCC TCCCCAAGTG 
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6481 CTGGGATTAC AGATGTGAGA CACCAGATCA GCCTCAGAAG ACATTTTCTA TTGGAAAGAG 

6541 AAAACACTAT TAGCAACCTA TTAGTCTAAT ATTTAATACT TAATGTCTTC CTTAGTAATA 

6601 AACCAACTCT CTACAACAAA GTGCTTCCTG GCTGCCTAGT CATTGATTCA TTCAGTTCAA 

6661 CATTTTCTCA ATGCCCAACA GCCAAGTGTC TCCTGTATGC CAAGTTCTAT GCTGATTATC 

6721 AGTATTTGAA TAAGAGGGGG TCTACATCTT AAGTACTGCT TAAGATGAAA GCCTCTAGGT 

6781 TAACAAACTT AACACAATGT ATCATTCACT ACTAAATAGA CCGAATACAA AATCTTGTTA 

6841 TTGGAGCCCA GAGAGAAGAA TTGAAATTCA AGTTTTCTCT CTCTCCTTTT CTCACTCACC 

6901 ACAATAAGTC AGTTGCACCA AGTCTTGTAG CTCTTTACTG AGCCATGTTT TCACGTGTCC 

6961 CTTTGTTTTA TTTGCCACAC CCTAAATAAA AATTGTACTG GCTTTTTTTC CCTGGGTTTA 

7021 CAGTATTAAT ACATTGTCAA GATTTACCTC TTCGTGTAGA TTCCCTGGGG AAAATTACCT 

7081 TTCCTCCTTC CCTTAAATTC TTCAGAGGTT AG AAAG C CAT TAGTAACATT CTGGTATGTG 

7141 GACAAAGTTT ACCCATTATG TATGGATGTT TTACTCTTTC CATTTTTCTG ACAATAATCT 

72 01 CTTAAGGAGG TGTGGTTATA GAATAGTCAG CTGTTATAAG TACTGTTTTC CTGGCCTTAC 
7261 AACTTAAATT CTTTAAGCTG TTTCTTAGTT TGCTCATCTC AAAATTCGGA ATAAGGATAA 
7321 AACCTATCTC TTAGATTGTT GGATTAAATG AATTAACATA CTGGAAGCTC ATGAAATGTG 

73 81 CCTGGCACAC AGTAGTGCCT AATAAACCAT CTCTCTTATT CAGCCTGTTT TCTGATTTCA 
7441 GAATCTACAC TTGCTGAGCC AGGTTCTTTT CATTTCAAGG TGAGCAAAAG CATACAAGGA 
7501 AGAGATGGAG GTAGGAAGAG ATTAAGCCCT AGGCCAAGGG AGCTGGAATC AAAGGCAATT 
7 561 TGGTCAGTGA ATAAAAAGGA TTCCAAGGCC CATAAGGCAA TTCTAACCTT AGGATCGAAA 
7621 TTCTCGGACA TACAGGAAAT GCTGGGGGGG GGAAAATCCG GTCTTCTCAG CCCAAGAGCC 
76 81 ATGTG AAAC C AGACCTTCAA ATCTGATGAT TCTCAGCCCA GCTGCCCATT AGAATCGTTG 
7741 TAATTTAAAA ATACCCTCGG AAAATTCTAA TATGTGGCTA TCAAAGGTGA TCATTTGCTT 
7801 TTATGCCACT TTGTTTTCAC CCAAATGGGA CATCCAACCC TTTTCCTTTG AGAGTAGTTG 
78 61 TAGGGAAAGG AGGGGGTGGA GGGAGGGAAG AGCGGAAAAG GCTGGATCCG CCCCGAGCCG 
7 921 GTGTCAGTAT CTGGGAAGTG GGAGGCGCGT CAGCAGTAAA CAGCTTCTGC TAGGATTATT 

7 981 ATCTCCTGCC ACACACTCGG ATTTGAAGGC TCCAAACGAA ACAATGCAAA ACGCTTCAGT 

8 041 GGAGTTCCAG AAGCGTTAGA CTAAACGACT GGGTCTGTTT GGCCAGTCTG AGCAGCTGGG 
8101 CGCAGATGCA TAGGCAAGAC TTAGCCCGCC TAGACTTTTC TGCCCACTTA ATTCCGATCA 
8161 AAGCAGAAAC CGGCCGGGCG CGGTGGCTCA CGCCTGTAAT CCCAGCACTT TGGTAGGCAG 
8221 AGGCTGGCGG ATC AC CTG AG GTCAGGAGTT CGAGACCAGC CCGGCTAACC TGGTGAAACT 
8281 CCGTTTCTAC TGGTGGCGGG CGCTTGTAAT CCCATCTACT AGGGAGGCTG AGGCCGGAGA 
8341 GTCGTCTGAA CCCGGGAGGC GGAGTTTGTA TGCAGTGAGC CGAGATCGCG CCACTGCATT 
8401 CCAGCTTGGG CAACAGGAGC AAAACTCCGT TTCAAAAAAG CAAGCAAACA AACAAAAAAA 
8461 TGCAGAAACC GAGATCCGGA AGAAAACCTC GGCGAGATTC ACAGAATCCA GGAAAATAG G 
8 521 TCTCTAGAAA TTTGTCCATG GTCCCAGATC TCCATTTCTT GTGGGTGGGG CAGCTGTTAC 
8 581 CAGATCCCTA GAAGCAAAGG TTTTTTTGGG GGACCGTGTC TCACTGTTGC CCAGGCTGGA 
8641 GGGCAGTGGC ACGATCTCGG CTTACTACAA CCTCCGCCTC CCAGGCTCAA GCGACTCTCC 
8701 TGCGTCAGCT TCAAGAGTAG CTGGGAGTAC AAGGTATGTG CCACCACGCC CAACTTATTT 
8761 TTTTATTTAT TATTTTTATT TAGTAGAGAG GTGTTTCACC ATGTTGGCCA GGTTAGTGTC 
8821 GAAGTCGTGA CCTCAGGTGA TCAGCCCCCT CGGCCTCCCA AAGTGGTAGG ATTAGAGGGG 
8881 TGAGCAGAAA GCAAAGGTTT TTGAGTGGCC ACAGGCCCCA CTCTATTTCC TTTTCTGCCT 
8 941 GTAATGGCAA CCTAGACGCT TGAGCTTCTT AAAATACAAG AGTAAGTTGC ATGTCAGGCA 
9001 CCGTTCTACA TTAGGGACAT TAGTCTGTTT T AC AG AC AC C TTTCAACTCC CTGGTTAACT 
9061 TTTAGGTAAT ATACTCTG C A CTTTAGCAGG AATGGAACCT ATAACTCTCA CAGAATTAGG 
9121 AAAGTGAGGC TGCCTACAGC CTAAATTGAG AAAAAAATAG ACGGGGGACT AGTCGGAGGA 
9181 CCAAACAAGG TTACCAACAC GTTAGAGTTT TGCCTTCAAT TTACATTTTT AAAGTAATCA 
9241 CAACGAAGTG TTTAGATCAC GAGGCATCCC TGCATGTAAA CTGTTAGGCA CTAACTATGG 
9301 TCGATCTTAC AAAGCATTAA CTAGAATATT TCTTTAGAGT ATGATAGTAC GTAACTGACC 

93 61 TACTATTACA TACAAACAGA CCAACCTTTA GTAACAGCGC TCCCCAAAAA CCGAAAAGCA 
9421 GTAATACGCT TTGCTCAAGG TTGGCATAAA ATTAACTTAC CTTAGTG CCT TTTTTCCTTC 

94 81 TACCTACAAG CAGTGAGGTT AGCTCTTCCT TTGAAACGGT AGGGGGGCTC TGAAAAGAGC 
9541 CTTTGGGTTT GATAGCGTTT CCGGGAGCTC AGATACCTGT CAAATCACTT GCCCTTGGCC 
9601 TTGTGGTGAC TCTCGGTCTT CTTAGGCAGA AGCACGGCCT GGATGTTAGG AAGGACGCCG 
9661 CCCTGAGCAA TGGTCACCCG GCCTAGCAGT TTGTTGAGCT CCTCGTCGTT GCGGATGGCC 
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GAGAGGACAG 

TGTTGATATT 

TGGGATTTGA 

AGTCCAAATA 

TAAAATGGCT 

AAAAAGGAAA 

C AG CTTATTA 

TTAGAGGACA 

AGAGTGCTTT 

TATTGTAAGG 

CAGAATTATT 

ACTTTTATTT 

CACTTTAAAT 

TATCTTTTTT 

CACAATCTCG 

CTCCCCAGTA 

TAGAGATGGG 

CCTCCCAAAG 

TTATGAATTT 

TCTTAAATTT 

TTTGTCTAAA 

CTAATTAAGA 

AAGTATTTTG 

TTTTATTAGG 

GACAAATTGG 

TAATATTAG C 

ACAAAAGCTA 

AGTTAACATC 

TGATAATAGA 

ATTCAATGTG 

GAATATTATT 

TTGGAGGGTA 

ACACAGTTTA 

ATACTGTTGC 

AGG AC C ATG.C 

TTGCCTGCTT 

TAGCTACAGA 

TTAAAAAGTT 

AAAATAAGAC 

ATGTAATAAC 

CTTCAAACAA 

AAACACATTT 

ATTAAAAGAC 

GACGAGGTCT 

CCTTTTCTCC 

AAAAAACAAG 

TTTTGCAGGT 

TTACTAGAAA 

AAATGAACTT 

AAGCCAATTA 

TACCCTCATA 

TGTTGTTACA 

TTAGACTTGC 

GCCAGAAATC 



GAAATCTGGT 

CTCTGTGAGG 

TGTTTTGTGC 

GTTGTCGATA 

GCCCTGTTAT 

TCAAGGAAAC 

TTAATTTTAG 

GAAGAAACAT 

CAATATCTGA 

GATGTGATGC 

CATATTCTCA 

CTTTAACTGA 

T CTGTTCT AT 

TTTTTTTTGA 

GCTCACTGCA 

GCTGGGATTA 

GTTTCGCCAT 

TGATGAGATT 

AAATAATTGT 

TAGTTGGCTT 

AAAAAATCAA 

GAAAAAAAGT 

TAAAAAAAAT 

TTTTTTTAAT 

CTTAATAATT 

AGAATATTAT 

ATTTAACTTG 

ACTTTATTTA 

TAATGT CATT 

TGAGCTTAAG 

AAATTGAGTA 

CAAAATACAA 

GAATAACCAT 

TTTCGCCACT 

AGGTTTTGGA 

TGTTTAAGGG 

GAAACACAAG 

GTTACTGTTT 

TTCAATCTTT 

CAATCTTCTC 

CAAATACTGC 

GGAATCTCAC 

CTCCAGAATT 

GAAATAGACA 

ATTATCTGTC 

CAAATAAACA 

TTGTAACAAG 

ATTTATTTCT 

GTTTAACTAA 

AATTCTTGGA 

ACTTTTTTTT 

AAGCCATTGT 

TCCTTTATGA 

GTGAAGACAT 



TTATTTATTA 

ACACAGGGTT 

TTGTATGCCT 

TCTGCAAAAC 

AACTTTTGAC 

CAAATGTCTG 

TAATTTCACA 

AATGTTGTTA 

ATAAAACAAA 

TGGAAACTAG 

GCAGTGGTGC 

TCAACATGCT 

TAGCACGGTT 

GACAGAATTT 

ACCTCTGCCT 

CAGGTGCACC 

GTTGGCCAAA 

ACAGGCGTGA 

GAAATTATCC 

ACATAAAGAC 

AAATTTTCCT 

TTAACTGTGA 

ACTTCACAAT 

AAGGAAAATA 

TCATTTTAAA 

AGTATACACA 

CATTTACTAA 

TTATTCTAAA 

TTTAAAAATG 

TACTGAGTTC 

AATTAATTCT 

ATCACAAGAA 

TGATAAACAG 

TTAGATTTGT 

TGACTGCCTC 

CTATGGTTAA 

TAAGCATTCG 

GTTAATGTGG 

TTCTTATTTT 

TGACAACATT 

TTTTATACTT 

TGAGAAATAC 

CTGGAAGTAG 

GCTTCTTCCT 

TTTCCAGTGA 

AATCTCAGTT 

G AC CTTTAT A 

GCCTGTGGCC 

AGTTGGCCAA 

GACAATTTGT 

TGCCCTACTT 

CAAAAAAACA 

GATATTTTTA 

GGCCTACCTA 



ACCTTTTTTT 

AGAGTTGGTG 

CTTTCCACCT 

CAGTATTCCT 

TTTAAGAAAG 

GTCTCAATAA 

TTATTGCCCC 

CAAATTGGAC 

GATTTAATAT 

GAAACTAGAA 

CACCTGAGGG 

AAATAGATAA 

AGCTTTCCTA 

TGCTCTGTGG 

CCAGGGTTCT 

ACCACGCCTG 

CTGGTCTCGA 

GCCACCGTGC 

ACTTAAGGGA 

TTAAAATACA 

TGTGCTTTAA 

GTTTCATTAG 

TTTTAAATAA 

TATAATACAT 

AATGGCTTCT 

AGTTTAGGGT 

ATTTCTTCCA 

ATTGTAAATT 

GAATTAAATT 

ACAGTGTATG 

CAATCTTTGG 

ACAGTGTAGT 

ATAAGAGAAC 

AAATCATGTA 

TGTTTTCGTC 

TCCAAACAGC 

AGATAATGAC 

TACATTCAAT 

TATATAGCCA 

ATAACAATGC 

CAGAGCAGAT 

ACTATCACTA 

GAAGTTTCCT 

TCTTTTACCT 

TGAAATTTTG 

ATATTTTACT 

ACTTGACTAA 

CACATTTGAG 

ACTGATCTTT 

ACTTTAAGGA 

CTGTGCTTCT 

AAAAACAAAA 

CCAAAAATGG 

ACTTGGAAAT 



GGGGTGTTTT 

TTTTTCTTTC 

TCCAAAACTT 

GTGTTAAGAT 

TGTTAGGACT 

CTGCTATGGC 

TTCACGTTCT 

TATTGAGTCA 

TTTCTAAACC 

TTTTCTTCTA 

ACTTCTGATC 

CCTATGGCTC 

ATTGGCAATA 

CCCAGGCTGG 

AGCAATTTTC 

GCTAATTTGT 

ACTCAGGTGA 

C C AG AAAAG A 

ATTAATAAAT 

TCAATTTAAA 

ATGTGCTACC 

TGGTCTTAGT 

CTTAAAAATA 

CTAATCAAGA 

TTATTCTTAT 

TCATATTCTA 

CTAGTTGTAC 

ATTCATTGAA 

TTATGTTACT 

ATAACTTTAA 

ATACCTGGAC 

TTTATGCAAA 

ATATGATTGC 

CTGTATACGT 

ATGCCTATGC 

TCTGACTCTA 

TACCTTGAGC 

TTACTATGGA 

TGATTTATAT 

TGGAACCTCC 

GGATATGTGC 

AAAATACAGT 

CTTCAAAGTC 

GTGGTATTAT 

ATCTGGCCCT 

AAGATATTGG 

AAGTTCCTAA 

TCAAAATAAT 

GAGACCTATT 

ATTCTTATAA 

CTAATATGCA 

AACTAAACAA 

AGGAGTTGAA 

GTTGGTTGTC 



TGTTTGAAGA 

TGACTTTACA 

GTCTTTTTTG 

GATATGAATA 

AACAGGAGAC 

AGAGGCTCTA 

TTAAGTAAGG 

GGAAAAAAAA 

TTAACGAGTT 

AACTGAGAAT 

TTAATTACAT 

TGTTTTTACC 

AGATTGAGAC 

GGTGCAGTGG 

CTGCCTCAGC 

GCATTTTTAG 

TCCACCTCGG 

CTATCTTATT 

TATAATGTAA 

TAAAAACTCA 

TCTTTAAGTT 

TAACAGCTTA 

TTAATACCTC 

TTATTTTTTG 

ACTGTAAAAA 

AAAAACAAAA 

TGGTTACATG 

CCAAATTAAA 

AATTATAAG G 

GAATTTAGGT 

AATTTCTAAA 

TAACATTTTT 

CTTAGAATAG 

GTGGGCGTAG 

GGGAACACAA 

TCAAGTACTA 

CTTTACTTAT 

TTGTCACTCT 

TCATATCTTA 

ATTTTCAGTA 

TTCCCAGTGT 

TCTGAGATTC 

TACAGAGGAA 

TCTGTTTTGT 

CCCAAGTATT 

CATGCTAACT 

ATAAGAATAT 

CAATTAGGAA 

CATCTAAGAC 

TATTTGTAAT 

GATTATTAAA 

ACTCACATGG 

AAACTCTGGT 

AGTGGAAAAT 
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ACTACACAGA 
AATTGTTTCT 
GAGTATCCAT 
TAGACAGTTT 
TTGTTGAGTG 
AAAATTAAAT 
GTAATAATTG 
CACTAGTTCC 
ACAAAAAGTG 
GATGTGGGAC 
CTTTACAGTG 
ATGCATATAA 
GCTCCTAAAA 
TAAATTATAA 
TACCGGGGTC 
CTCTCTAGGC 
CAGATTTGGA 
AAAAGACATT 
CAAGCATAAA 
GGCTCTGAGT 
GTTCATTATA 
CTTGACCTCA 
TGGAGTGTAG 
CTCCTGCCTC 
TAATTTTTTA 
AAACTCCTGG 
GTGAGTCACT 
GGGACTTTGG 
AGAATAATTA 
GATCTCCTTG 
CTATGAGGAA 
TTAAAATTCT 
TCAAGAGAAA 
CCTTCCTTTA 
CAAACTCAAC 
GTTAGACTTG 
CATTTGAGCT 
TGTTTAAAAA 
CTAGCCTCAA 
GTTTAGAGAG 
CTTCGTCGCT 
TTATCCTTCA 
TTAAGAGCGT 
CCAACAGCTG 
CGCAGGGAGA 
TCTATTGTGT 
GCCATCATCA 
AGAAGTGAAT 
TCTTTTCCAA 
TGAGTGATCC 
ACACCTGGCT 
ATAGAAATTT 
ATTCTTTTTA 
GTATCATATA 



GATAGCCATA 
AGAGAATCAC 
ACTAAACTCT 
GTAGTTTTTT 
AAATCAGTCC 
GTCTTTCAGT 
CCCTACTCAT 
AGACGTGGTA 
AAGCTTCTGG 
TCTGAGGCAG 
ATGGTAATAG 
ACCACTGTGT 
GGACTTGAAG 
GAATTTCATA 
CAACAGGTTG 
ATATTCCTAA 
AGGATATATA 
AAAAATTAGT 
ATTAAATTGA 
AATTTCTTTG 
ATTAAGAAAA 
GTTCTTTTTT 
TGGCGCAATC 
AGCATCCTGA 
AAAAGTTTTT 
GCTTAAGTGA 
GTACCCCGCC 
TTTGCTGATT 
ATAGAGACAT 
CTGCTGGCTC 
ATAGACCTAT 
AGGCTTATTC 
TATGAATAAA 
TTTTCTTGTC 
TGTAGGCTAG 
CTTAAACAAT 
TCAGTGCACT 
ATCTGCAGAG 
GAGTGGATCA 
TGTGCTCAGG 
GTATCTTCTT 
AGTTTAGATC 
ACAGACATTC 
TGCTACCTGG 
ATGACAGTAG 
AAAGTGCTTA 
TTATCATTGT 
CAATCACTCT 
CAGTCGTCAC 
TAGAAGAAGA 
GAGAAAAATT 
ATGACACAGG 
TATGTATATT 
TAAAATAAAT 



GTGCTGCACA 
TAATTGTTTT 
TTTCTACTGA 
TCTCCCATTT 
ATTGCTTGAT 
CACAGTTTGA 
AAAGATGGGG 
TCATGCTAGT 
AGACAGACTC 
GTCATTTAAT 
CACCTACCTT 
TTACTGCTGT 
CAGCTTATGA 
AATTATTTGA 
AGAAAAAATA 
GGACTTAAAG 
TATTCAGCAC 
GAAACTTTTC 
GTAGAGTATA 
GGGTCTGAAG 
AGGGAGTAAA 
TCAGAGACAG 
GCATCTCATT 
GTATCTGGAA 
TGTAGAGATG 
TCCTCCTGCC 
CCACTTCAGT 
TAAAGATTCA 
CTGGTCTCAT 
AGAAGGGTAA 
GTAGAGGAGG 
TCTGACCATA 
CTTTTGTTTT 
CTTAGTTTTC 
AACAAAAAAA 
TGGGGTAATG 
GAAATAAATA 
AACAATACAC 
AAGATGCTCA 
GTTCTAGGCT 
TATGAAAAAC 
AAATGGAACT 
AAGGGCTAGA 
GAAACTTAAC 
GTATCTCATA 
AAACACTGCC 
TCTCAGAGTC 
CTCTCTTTTC 
TGCTGGACAC 
TAAATGGAGG 
AGCTCTTTTT 
AAACATAAAG 
ATATATACTC 
TTAGGTGTCA 



GCCAATCTTA 
CTTTTAACAT 
AAATAATGTG 
C T ATTTTAT A 
ATACCTTGAG 
CAAACTCAAC 
TGAAGATTAA 
AAAATGGCTG 
CAAGTTTGAC 
CTCTCTGTGC 
CTAGAAGTAT 
TTGACAAATT 
CTGAAGACTT 
TATGAAAATG 
CACTTTTTTT 
AATGATAACT 
ATTGACAGAC 
CTACCTTTAG 
CCACTGTAAC 
ATCAGTTTGA 
TCTGGAGAAT 
GGTCTCACTT 
GTAACCTCCA 
CCACAGCAGG 
GGGTCTTACT 
TCAGCCTCCC 
TCTGAGGAGG 
TGTAACCTTA 
GTTTCTACAG 
AAGAGCAGAA 
CTACCTGTGG 
TCAAGTTTTC 
CACTTTTCTC 
TTTTCACTTT 
AATTGAAAAT 
AACCTTGGAC 
TATTTTTAAC 
GTTGTGAGAT 
GCAGGCAACA 
CTAAAAATCA 
ACTAAGTCTT 
TTAGGACACT 
GGATGTGGGT 
CTCTCTGTGC 
AGGTTGTTGG 
TGGCACAGAG 
AAATACAATA 
TCCAGGGGGA 
TGTTTCATCT 
TATTTTGAAC 
TCTATGCATA 
ACAAAATTAA 
ATATTCATAT 
TGATATATAT 



AGTGTTTCTA 
TCTTGGTTTA 
CAAACATAAC 
AATCATCTTT 
CACAAGTAAA 
TACCCTGAGC 
ATGAAATAGC 
CACAGCACTG 
TCCCAGATCA 
ATTAGTATCC 
GTGAAGATTA 
TTATTTATAA 
TGGTAGGAGT 
CCAGTTGATC 
CCCTGAACAT 
ATCATTTCTC 
AATCCCAGTA 
CCTGTGTAAT 
ATTTC CTGAA 
CATATCCTCA 
GAGCCACTTT 
TGTTGCCCAG 
CCTTCTGGGC 
TGCACACCAC 
ATGTTGCCCA 
AAATTGTTGG 
AAAAAATATG 
TCATCCAATG 
TTGCTCATGC 
ATGATGGGGC 
TAAAACCTTA 
AAATGGTAAA 
CCTCCTCTCC 
TTTGTCTACT 
TAAAATGTGC 
ACTAGATTTT 
AATTAAAAAA 
CTTGAATGGA 
G AGTAAGAG C 
GACAGTCCCC 
TTTCCTCACT 
GACTAGGTTA 
TTACTGCACA 
CTTAATTTCC 
AACAACTAAA 
CAAACATCCA 
TCTCATATCT 
GACAACAGCT 
TGCAAATAAA 
AATCAAAGAA 
AAACTATTAA 
AATAACTCCT 
ATACATATAT 
TTAGATAAAT 



GAGAATCACT 
TACAAGAAGA 
ATCCTATTCC 
TTAAAATACT 
TAGTATG CCA 
CTATAGAGTG 
ACCTATAGAA 
CTCAATGATG 
CCACATATAA 
TTCTCTATAC 
AAGATCCTTA 
CCATCTTTAC 
TGGCCTTCTA 
ATAGTATGTT 
ATGAAATTAG 
TTAAATCTTC 
GTCCTAAATT 
CCTGGATGAC 
AGGTATTCTA 
AGTATCATGA 
CTTACTACTC 
GCTGCCAGGC 
TGAAGCCATC 
CATGCCAAGC 
GGCTGGTCTC 
GATTACTAGT 
TAATAATAAT 
CGCAATTTGT 
CTTGATAGTA 
TTCTCTCATT 
TCCTCATCAC 
AGAATTGGAT 
CCCCATTCTC 
ATTATTTGCC 
CCCTTTTGTT 
AAAACACACA 
TAAAATTGCA 
AGGAAAACTG 
ATGTTGGAGG 
ACGGCCTGGC 
GGATAAATTT 
CATTCATCTT 
GGCTCATTAT 
TCATCTATAA 
TGCATTGGTA 
GTGAACTTTA 
GATAAATTAC 
TTTAGACATA 
CCAATGAAAA 
GGACAAATGA 
AATATTCTTC 
AGTATCTCCT 
CTCACATCAT 
ATACTTAGAA 



Figure 9 (Page 6 of 74) 



WO 98/14466 



PCTAJS97/17658 



95/162 

19441 ACTTTTTTAT GGATGTATAA TTTATGGATA TATTGATAAT TATGTATTTG TTATTGACTA 

19501 CTTCAATTGA TTCCCATTTT TATGCATTAT ATTATAGATT ATATAGCTCA CACATCTTTG 

19561 TACATAAATC TTTGTTCAAA TATTATTTCC TAAGGATAGA CTTCATGAAG TGGAAATACT 

19621 AAATCAAAAG TGAAAAACAT TTTCTAAGGT TCTTAACATA TACATTGCCA AATTGCTATT 

19681 CAGGATCATA C CAATTTAT A ATCCCAAAAT AATATGAAAA TTCCTGTTTT ATAGCACTCA 

19741 TATTTACAAT AAATTTTAAA AATCACTGTT AACCTAATAG TCCTTCAAAA GAAAAAAAAA 

198 01 TTGAAATTAC ATTATTTTAA TGACTCTATT AGTGAGGGTC ATTCTTCCCA TGTTTCTTGT 

19861 TAGCCATGAC CCTATAAGAA ATAAACTG C A CTGCAAAATG ATAAACATGA TATCAATCAT 

19921 TACATGGGAA GGCACTATAT AAAGAATAAT ACCTTAGGTT AAGGCCACAT AAATATTTAT 

19981 CAGGTGCCTT TTCTGCGGAG GACTCTGAAG GGATACTAAA CTGCATTTAG CTGCATGCAA 

2 0041 CTGAAATTAC TTTTACCTAC ATTGTCTCTT ATAAACATTA TAACTACTCT TTGAGAAAGT 

20101 GTTTACTATG GACTGAATTG TCTCCCCATC CCCCCAAATT CATATATTGA AGCCATAAAC 

20161 CCCAATATGA CTCTATTCCT AGACAGGACT TATAAGAGGT AATTAAGGTT AAATGAGGTC 

20221 ATTAGGATGG GTTCCTAACT GGATAGGATT GGTGGCCTTA TAAGAAGAGG AAGATTCTGC 

20281 ACTTGGTCTT CCAAATTAAA TAATTTATTT AAAAGAAAAA AAAAAAAAGA GGAAGAGAGG 

2 0341 GAGCTCTGCA CATATACTGA GGAAAGGCTA TGTGAGCTCT CACAGTGAGA AGGTAGCACT 

20401 CTACAAGCCA GCAAGAGAGC CCTCACCAGA ATC CAGC CAT GCTATACCCT GCTCTGAGAC 

20461 TTCCAGCCTC CAGAACTGTG ATAAAATTTT GTTGTTTAAA CCACACAATC TATGGTATTT 

2 0521 TTTTATGGCA GCCCAAGCCA ACAAAGACAG CATCATTGCT GTCACTTACA GACAAGAAAA 

20581 CTAAGACTAG GAGAGAGAAA AGTTAAACTT GTCCAAGGTC ACAAAAGCCA GAAACAAGTG 

20641 AGGTGAGAAG TTG AC CTTGT TCTCCTCAAT CCAAGGCCAG GACTCCTCCA CTCCACATGT 

2 0701 AGATAGCCAC CTCACAGTCA ACAGCCAAAT GTCCACACCC CAGAGTCAGC ATTAGACCAA 

20761 GATGTCTTAC CAGGAGACAA ATGCCTCATC TTGAATAAAT ATGTTCTAAC AACTTACCCA 

2 0821 TGTAAAACAT TGAATCTCAT GAGAAACAAA AATGCAAAGT ATGTAGAAAA CTATGTTTAC 

2 0881 CACTTAACTG ACAGTGATAA AAAGCTTAAT GATATCCTTA TAGTCTTGGA GGGGTTTGTA 

2 0941 TATGTGGTGA AACAGGTGCT CACGCACTGC TGATAGACTG TAAATTGGTC CTAGAGAGAA 

21001 AAATAAATAA ACTGGAAGGA GTTATG CTGT ATGTTTACTT TTTTTATGGA AACATATGAT 

21061 ATACCTGGAA ATTCGATTGG CCATGCATCT ATTTCTTCAA TGGGTATGCA CAGTTGAGCT 

21121 GTTCCCATGC ACCAGGCACT GTAATGGGAC AACTGCACAT GACAGTCAAA AATCTCAGTC 

21181 TCATGAAGTC GACATGCTCA TGGAGAGGTG CTACCCACTA AACTAATATT TGTATATCAA 

21241 TTATGGATAC ATTGGGCCAC ATTTACAGAA ATTCACTTAC AGTGGGTTAC CAGAAGGGAT 

213 01 TTTTTTTCTT GATTGGCAAG AAGGCTAGGC TGTTTTGTTG GGGGCTGGCA GGAGCTGTCT 

21361 AGGCTGCCCA AGTATGCAGG TCTCTTCTAT CATCCTGTGT TAACCATCTT CCATGTATCT 

21421 TTCAACCTCA TGGTCATCTG CAGCATGTCT AGGGGTCATA TCTATGTTCC ATGCAGGAAA 

21481 AAAGGGTAAA GGGAAAGGGA AGTAGGCATG TACCATTTTA ATGCACACCT TGGTTTTCAG 

21541 AAAATTTAAG AAGAAAGACT TTCTGCTTTT CTCTGACTAT TCTGTATTCT GGATTACAAC 

21601 G C AAC AG AAA CGTCACCTTA AATT CTAATG TTTTTCTCTC CTTGCTTTCA AAAACTGACT 

21661 CATTAACCTC CACGTGGCTT GGAAAAATTA TTTCAGTCAT CCAGTAATGA GCTGTTCATA 

21721 GAAATGTTTT GGACATCAAG TCTGTGTTGT TAGCATTATA CATGTTAAGC ATTGAATAAA 

21781 AAACAACATG ATGTGGGTAC ATTTCTTTAC TTACATATAA GTACTTATAT ACTTATAGCT 

21841 GAAAAGAGAG GTTGAAATGT CAGGTGGAAC AGAAATAAGA TTAC CTAGAT GTTTCTCCTA 

21901 TGGGTGATTT TCAGCTATGC TGATCTTTCT TCTGGG TCAG GTACTCCCAG AACTTCCTAA 

21961 TTAAATGGTG GCCCTGATCT TAGTTCCTCT CTCCTCTTAG ACATTTTCCA GGACTACAGA 

22021 AGATGTGCAG TTTATAAATG AGTAGCAGAA ACCTACTGAA CAAATTATTC AGGCTCATCT 

22 081 GAACAGAGAG GACACCTTCT CTGCTATACT CTCTCAGTGA TTTCCCTGCC TTGGGGTCAA 

22141 TTATTGTCTT GGACATTGAT TTAAGCACAT AATAATTGTT GTCATTGCTT ATGTTTGGAT 

2 2201 TTCATCTCCC AAAATAGATG GTAAATTCTT TAGTTTAGAG ACCAAGTAAT ACTTACAAAA 

22261 AAATTTTGTG TGTGTGTGTG TGTTTTTTCT GTGTCTCTCA GCCCTGTAAT AGCATCGTAC 

22321 TTACACTTGT TAGATTTTTA GAGACAACTT TTACAAAACA TGGAATTATC T AC AT AC C CT 

22381 TTCTACAAAA CAGACAAATT AAATACTCAG TAGTTGAACC AAAAAAAGCA GTTCAAATAA 

22441 AATACTTGAA AATGAAGAAA TCATTTGAAC AGAGTTAAAG TTAATCGTAA AATAATGTCT 

22501 GTAAAAATTA TTGCCAATCA AATATAAAGT TCAAAAATAG TGCTTGAAAA AGGAAGAATC 

22561 ATATGAAAAG GGACTACTCA TTTTAAAAAT GTTAGATATC AGGAAAAGCC AAGAAGTGAG 

22621 TATGGTAAGA GTGCTGTCAA GTGAAACCCT GCTAATCTCA CTGAACATGT AAAAATCTGT 
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22681 AGATGCCTTT ATTTTATTCA CTCACACACA TATGTAGAAA GAGAAATATA TGGTAAACAT 

22741 TAAAAAAAAC AAATTAGAAT GTAAAATTAA TACTTTAAAA AATGGGCTGT ATACTTTTCT 

22801 TATCACCGGA GATAAGAATT TATTATTTTT AAAATAAAGT TATTTTCTCT GTGACTGTTT 

22861 CCATGACTTT GCTACTTAGA AGTTAGAGAT GCCAAAGTTT ATCTAAGAAA ATGTTTATGG 

22921 AAATATTATT TCAATAATGA ATGTTTAGAA GACTGAATTT CCTGACTGGG CACAGTGGCT 

22981 CATGCCTGTA ATCCCAGCAC TTTGAGAGGC TGAAGAAGGA GGATCG CTTG AGTCCGGGAG 

2 3 041 TTCAAGAGCA TCCTGGGCAA CACAGCGAGA CCCTGCAGCA AAGTAAAAAG AAAAAAGAAT 

23101 TGAAAAAGGA AGACTGAATT TCCTTTGGGC AAGTCATGTG ACATTCCTGT GCCTCAGTTT 

2 3161 CTTCATCTAT AAAGTTAATT CCTACATTTT TGGGGAAGGG AGAGAAAAAC TTAGGATAGT 

2 3 221 GACTGGCACA GAAGAAGCAC TATATACTAT ATATATGTGG ATATCATTTG TTTTTATGGT 

23281 ACCATTTTAG CTATCTAATG CAAAATATGA ATCTTTTTTT TCTGGGTCTT AAATTATGGA 

2 3341 ATGTAAGAAT TTTCTAAATT CTCTAATTCT GTGTTAGTTT TAAAGCAATG GAGTAACGTA 

23401 TCTGTCAACT TGTAAATATA AGGATCAACC TGATCCACAA TTTGACCCCT AGCCACTAAT 

2 3461 ATTTAATAGT ACAACACTCA GAAATTATCA AAGGTCAGAG AAGCCAAACA AATGTAAAAA 

2 3 521 CATACAGGTG CTCAGAAAGA TGCACCTGTA ATCTCTCTAA GGAGAAATAT TTTCCAAACT 

2 3 581 GAGTGACACG GTGCTTTAGT GAGTTGTGGA ATCAATCTCA TGATTTCCAA CCTAGTGTTC 

23641 TTTTAAAAAT GAACTAGTCC ACAGTAGAAT ATACTAAAGT GCTGGTGCTT AAGATAGTAT 

2 3701 TGTTTTCTGG AAAAAAAAAA AAAATTTTTT TTTTTTGAGA CAGGGTCTCG CTCTTGCCCA 

23761 GGCTGAAGTG CAGTGG C AC A ATCATGCTCA CTGCAGCCTT GACCTCCTGG GCCCAAGTGA 

23821 TTCTCCCACC TCAGCCTTTT GAGTAACTGG GACCACAGGT ACGTGCCACC ACACCCGGGT 

2 3881 AATTTTTTAA T TGT AG AG AC AGGGTCTTGC TATGTGCTTA GGCTGGCCTT GTGAACTCCT 

2 3 941 GGGCTCTAGT GATCCACTAG CCTCAGCCTC CCAAATTTAT GGGATTATAG GCATGAGCCA 

24 001 CCCTACCTGG CCTGTTCCCT GAATTTTTTT TTCTTTCAGG TGTTTGTGCA TATGTGTGTG 

24 061 TGTATGGGTA TAACAGAGAG ACAGAGAGAA AGAAACTTTT CTATCACACT TTGCAATCAG 

24121 AAGTTTGAAG TCTTATCTTT TGGCTTTTGT TTCAGAAATA TTTCAAATGT AGACTCTCTC 

24181 CTTTACCACA CTGTCCCCTT AGGCAAGGTC TTTGCCATTC TTCTGAGACT ATTGCAACAG 

24241 ACTCCCAACT TCTGACTGTG GGCCCTTCTC AAAAATGATT GTTTATGCAA TAAATCTAAA 

24301 CCCAAGACAA CTACAACAAT ACAACAAATT CTCTGCTTAA AAACTTCCAA TGTCTGCCGG 

24 361 GCGCGGCGGC TCACGCATGT ATTCCCAGCA CTTTGGAGGC AGAGGCGGGC AGATCACTTG 

24421 AGGTGGGGAG TTCGAGACTA GCCTGGCCAA CATGATGAAA CCCCATCTCT ACTAAAAATA 

244 81 CAAAAAATTA GCCAGGCATG GTGGTGGGCG CCTATAATCC CAGCTAATTG GGAGGCTGAG 

2 4 541 GCAGGAGAAT TGCCTGAACC TGGGAGGTGG AGGTTGCACT GAGCCAAGAT CACACCATTG 

24601 CACTCCAGCC TGGGCAACAA GAGCAAAACT CTGTCTCAAA CCAAACCAAA ACAAAACTTC 

24 661 TAATATCTAC CAAATGTTTC ACACAAGTAT TTGGGGATCT TCACAAATGG CCCTTATGGA 

24 721 GTTTTCCTTT GCTGAGACCC TATGCTCTGG C C AC AC T AAA CTCATTCAGC ATC CCAGAAA 

24 781 GGCCTCAGCC TTTGTGAGCA AGCTCTTATC TCCAGGCCTC TCACAAAGAC CTGTTCCAGT 

24 841 AGAAGCTCAG GGGAGCACAC TGGACATTAT TCCAACAACC CTTTCCCCAC AGCTATGCAG 

24 901 CCAAATCTGC CAGCTCAGTT AATTAATTAA GCAATTCAGA GATGAGGGTC TGCCCAGGCT 

24 961 GGAGTGCAGT AGCTGCGACC TCAAGCTCCT GGGCTCTAAG TGATCCTCTT CAGTCTACCC 

25021 AGAAGCTGGG ACTGCAGGCA TGTGCCACCA CACCCAGCTA ATTTTTTTTT TTTTCAGTAG 

25081 GGACCAGGCC AACCTAGTCT TGAACTCCTG GCCTCCAGCC TTCCGAAGTG CTGTAATTAC 

25141 AGGCATGAAT CACTGCGCCC AGCCAACCCG CCCAGTCTTG TTAGACATGG GGTCTGTAGT 

25201 TTCTAGTAGG TTCTTGAGTC TAGGGTTCCT ACCTCATGTT TTATAGTTAA TTTAGGGGAG 

25261 GGACTGTGTC TGTTTATCTG GGGATGTAGG GGTGGGCAGG GGGATAGAGG GGACTTCAAT 

25321 TAATGAAACC AGAAGCAAAA CTCAGTTGAG GACACCGGTC ATGAGAGTGG CCTGATTATG 

253 81 GCCAATCTTA CATAATGTGT GAGATCTTGA TATTACCCCA TCCTTGAGAG TCCTCTATAA 

25441 AGCTACAGGG ACTTGGG AG C ACCTTTAATT AC AG AC AAC C CATGTTCCTG TGGATTATGA 

25501 TTTATTAGAT TGCACATGCC TAAATAAAGA CATCCTCTGC AGTCTTTTGA CAATTCTATA 

2 5561 AGCATCTTCT GACTCCGCAA TTAGACAGCT AAGAGATCTG TGTTACTTCC CTCACATATA 

2 5621 TAAATAATTT TAAATAAAAA TCATGGCGTG AATAATTTCT TTCCTCTACC GATTTGAAGC 

25681 TATCCATTTG GAAGAC C ACT CTGAAGAGAT GAAATAAGTC TTCTGCCAAA GATTACTTAT 

2 5741 TAATTTACAA GGAAAAGGGG AAGTTTTGTT CCTCTCCGTG AATTTGATTG AAAATCGAGG 

2 5801 GCTTTCTCGA ATAGTTTTGG CATCCAGGGT CATTTTTCAT TAAAAAGAGA AAAGTCATGT 

25861 CAAATATGAA TTTCCGCAGA TTATTCAGCA CTAGACCCTG GGAGATTCTG TAAAGAGGGG 
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25921 TTTTGTTATA CTCAACTTTT CCGGGTAAAA CAAACACAAA TACTCCTCCT CCAAGGGGCG 

2 5981 GGGGCGGTGC CTAGGTGATG CACCAATCAC AGCGCGCCCT ACCCTATATA AGGCCCCGAG 

26041 GCCGCCCGGG TGTTTCATGC TTTTCGCTGG TTATTACATC TTGCGTTTCT CTGTTGTTAT 

2 6101 GTCTGAAACC GTGCCTGCAG CTTCTGCCAG TGCTGGTCTA GCCGCTATGG AGAAACTTCC 

26161 AACCAAGAAG CGAGGGAGGA AGCCGGCTGG CTTGATAAGT G C AAGTCGC A AAGTGCCGAA 

26221 CCTCTCTGTG TCCAAGTTGA TCACCGAGGC CCTTTCAGTG TCACAGGAAC GAGTAGGTAT 

26281 GTCTTTGGTT GCGCTCAAGA AGGCATTGGC CGCTGCTGGC TACGACGTAG AGAAGAATAA 

26 341 CAGCCGCATC AAACTGTCCC TCAAGAGCTT AGTGAACAAG GGAATCCTGG TGCAAACCAG 

26401 GGGTACTGGT GCTTCCGGTT CCTTTAAGCT TAGTAAGAAG GTGATTCCTA AATCTACCAG 

26461 AAG CAAGGCT AAAAAGTCAG TTTCTGCCAA GACCAAGAAG CTGGTTTTAT CCAGGGACTC 

26 521 CAAGTCACCA AAGACTGCTA AAACCAATAA GAGAGCCAAG AAGC CGAGAG CGACAACTCC 

2 6581 TAAAACTGTT AGGAGCGGGA GAAAGGCTAA AGGAGCCAAG GGTAAGCAAA AG C AGAAGAG 

26641 CCCAGTGAAG GCAAGGGCTT CGAAGTCAAA ATTGACCCAA CATCATGAAG TTAATGTTAG 

26 701 AAAGGCCACA TCTAAGAAGT AAAGAGCTTT CCGGGAGGCC AATTTGGAAA GAACCCAAAG 

2 6 761 GCTCTTTTAA GAGCCACCCA CATTATTTTA AGATGGCGTA ACACTGGAAA CAAGTTTCTG 

2 6 821 TGACAGTTAT CTATAGGTTT AAGTTGTGAT GCAGCTGAGT TGAAAAGGCT TGAGATTGGA 

26 881 GAATTAATTC AGGCCAGGCT TCAAGACCAT CCTGGGCAAC ATAGCCAGAC TACCATCTAT 

26 941 ACCAGGGGTC CTCATTCCCC CGGCCACCGA CCGGTAACCG GTCCCTGTCC ATGGCACGTT 
27001 ATGAATTGAG CCGCACAGCT GAGGGGTGAG CGAACATTAA CCAACTGAGC TCCACCGCCT 
2 7061 GTCAGGTTAG CTGCAGCATT AGATAGATTC TCATAAG CTC AAACTGTATT GTGAATGGCA 
2 7121 CATGCAAGGG ATCTAGGTTT CAGGCTCCTT GTGACAATCT AATGCCTGAT GATCTGAGGT 
2 7181 TGGAGCAGTT TTAGTCCGGA AATCATTGCT CCCAGCCCCT GCACCCCCTG GTCCGTGGTA 
27241 TAATTGTCTT ACACAAAACG GTCTCTTGTG TCAAAAAGGT TGGAGACTAC TGGTTTTACA 
2 73 01 AAAAAGTAAA TTAGTC AAG C ATGGTTGGCA CGCTCCCTTA GTCCCTGCAC CCAGGCGTTT 
2 7361 AAGGATACAG TGAGCTATGA TGGTGCTACC TCACTCCAGC CTGGGTGACA GCGAGTCAGA 
27421 CGTTGTCTCA AAACTTAAAA AAAAAAAAAG TTAAAACAGA AAAAGGGCTT CTTGTCAGAG 
27481 ACTGCCGTAT ATCTAGAGGT CCAGGAACTA AAAAGTCTGA TGTCCAATCC TGAAAAGCTC 
2 7 541 GATGGTGCAC TAGAGGAGGC TTTTACATGT AAGAGCATCT AAGTTCTGGA AATGCCAGTG 
27601 TCAGGGAAGG GAAGTGGAGA GCAATTTGGC ATCCAAACAT AACTTGCTGA TACTTTTTTT 
2 7 661 TTTTTTAACA CAAGTACTAC ATTCTAGTCT TTCTGTGGTG TCATTGTAAC TATTGTTTCT 
2 7721 TAATATGCTA TCCACTGACT TCAAGGGATC AATAAATAGG AATCAAGGTG TCCCAGAATA 
2 7781 TGGATTAGGG GAGTTTTTTT TTTGTTGTTG TTGTTGTTGT TTTCATCTAT TCATTATCCT 
27841 GTAGCTGAAA TTTAGAATTT TCTTCCATTG TGTGTGACTG ATAGAAATAA CAAATTTGTA 
2 7 901 GGTTATAGTT GTTGCAAGAA TCTGGAAATC GTGCTTGCTT ATTTCCGAAG TACTATTAGG 

27 961 TATATCAACA AAAACACACA TATTACGGTC AAGTGGTTTG ATAATTATTT TAATATTATT 

28 021 GGTCTAATAC AATTGTAACC CTATGAATTA CTTTAAGTAT CTTATTTATG AAAAGAATCT 
2 8 081 GTAAGTTTCA TCAAACTACC AG AGCATAC C GAAGACTGAA AAATTTTAAG AATCCAAACC 
2 8141 TTAATGGAAA TGTTGGAGGC TGCCCAATTA GGTTCTGAAT TCCACCTTCC TGAATCACAA 
28201 ACTTGTTTTA ACTCTCAGTC TGAGGTAAAC TACGTTTCTC TTTAAACAGA CATAGTTTAA 
2 8261 TTTTCCTTTG ATTTTTGATT TAGTATTCTT ACTGAT CATC ATAAATAACC AATG CTAATG 
2 8321 TTAGTCTACT TTGGACCATG GTATTTCGAG AAACTTTGAA CAAAGTCCCC TGCAAAACTA 
2 8381 TG CATTG CAT TATTTCACAT ACATTTATGT TTTCCAGACG GTTCAATAGT ACCTCACTTT 
28441 TCTGAACTTA TTTGT AT AG T TTGGCATCTT TTTAAAAATT GTGTCCTATA ATGAAAGGTT 
28501 GTAAACATTA TGTTTTAAAT TTGTATAGAT AAAATCAACC ACAGACCTTT CCTTGCTTGG 
2 8561 ATGTAATTGC CATTGTTTCC CAATGAGTTC GGAATTACTA GGATTGTG C A AAAATATGCC 
28621 TCACTTGCCT GACATAGCAG AGAGCCATTT TGCCTAAATG CTGTGCCCAG CAATGGACTG 
28681 TCACCAGATT CTCATCACAT ACAGTGAGGA TGAACAACTA GCCTCTCCCA GCAGCTGGCC 
28741 GGTCTCTCAA TAATATGGGA CTCCCTCAAG ATGGCTTCCT GCACCTTTGC TCCTCTAGCC 
2 8801 TTGTATGTAT ACAAGGCTAG CATGCCTGGC ATACATAAGG TTAAAAACAA AATCAATAAG 
28861 TTATGGTTCT TCCTCCAGTT CTGGGGATTA TTAGACCACT TTTTTGTTTT GTTTTGTTTT 
28921 GGATGGAGCC TCGCTCTGTC ACCCAGGCTA GAGTGCAGTG GCACAATCTC GGTTCACTGC 
28 981 AACCTCTGCC TCCTGGGTTC AAGCAGTTCT CTGGCTCAGC CTCCCACGTA GCTGGGATTA 
2 9041 CAGGTGCCCG CCACCACGCC CAGCTAATTT TTGTATTTTT AGTAGACGGG GTTTCACCAT 
2 9101 CTTGGCCAGG CTGGTCTTGA ACGCCAGACC TCGTGATCCA CCCACCTTGG CCTACCAAAC 
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2 9161 TGCTGGGAAT ACAGGCGTGA GCCACCGCGC CCGGACTTAG ACCACTTTGT TTTGGCCAAT 

2 9221 AGGACAACAG C C AT AGAACC CTCCGCAAAT GAGAGCTTGT CCCTAAAGAT GCTTTATTTA 

2 9281 CATAGCTGTG TGCCGCATGA GCCAAAAGGT GATAACCTTT GTTCAACACG CGCCTCCAGC 

2 9341 CCTTCGGTTA AGTCCAAAGT ACCATTCTTA GAATGCTCTA AAATACATAA TTTTTTTTTT 

2 9401 TTTTTTTTTT TTTTTGAGGA GTCTCTCTCT GTCTCCCAGG CTGGAGGGGA GTGGCGCGAT 

2 9461 CTCGGCTCAC TGCAATCTCT GCTTCCGGGC TAGCTGGGCC TACAGGTGCA GACCACCACG 

2 9521 CCCGGCTAAG TTTTGTATTT TTTTTGGTAG AGGGGGTTTC ACCATTTTGG CCAGGCTGGT 

2 9581 CTCGGATTCT TGATCTCAAG TGATACACTA GCTTTGGCCT CCCAAAGTGC TGGGATTACA 

2 9641 GTCGTGAGCC ACTGCGCCCA GCAAAATGCT TTTTGTGGAG CCAATCACTT TATTAGCGCT 

2 9701 TACCTCTCTA TGCCTACTTT ATGCTTTGAA ATTTTGTCAC AGTGGGGCCG GTCATGGCAA 

2 9761 ACACAATTCA TTCTTATGCA GGCTGTCACG GTTATTTCTG TCATCCAAAC TCATTCTCGC 

2 9821 AACGCATTTC AGCTCTTTAA ACGACTTTGT GAGCGGCCCT GAAAAGGGCC TTTGGGTTTT 

2 9881 TTTGTTTTTG TTTTTTGAAG TTCTCAGGAG ACCGCGTATT CTTAGATTCA GCCGCCGAAG 

2 9941 C CAT AC AG AG TGCGCCCCTG ACGTTTCAGG GCATATACTA CATCCATGGC TGTGACAGTT 

3 0001 TTGCGCTTGG CGTGCTCCGT ATAGGTGACG GCGTCTCGAA TAACGTTCTC TAAGAAAACC 
3 0061 TTAAGCACAC CTCGAGTCTC CTCATAGATA AGACCGGAAA TGCGCTTGAC GCCACCGCGC 
3 0121 CGAGCCAAAC GGCGGATAGC CGGTTTTGTA ATGCCCTGGA TGTTATCCCG GAGCACCTTA 
3 0181 CGATGGCGCT TAGCACCACC CTTCCCCAAG CCTTTTCCGC CTTTGCCGCG ACCAGACATG 
3 0241 ATTCCTATCG CAGTGGAAGG TATGAACTGA AACAGTTCCT T AAATACATAA CTTGGCGGAC 
3 0301 CTGATTGAAA ACAACATGAG TTGGCGCGGT TTTTTTTTTT TTTCAAATTT GGTCACCGAG 
3 0361 TGGGTGGAGC AAGAAAAACT GTTTCATTAT GGTTCATTGT TTTGATTGGC CAGTGACAGC 
3 0421 TTGCTCTTTG TGGGAGTGGA AGGGTGTTTG CAAGTTGAAT GCGCTGTATT CCTGTCAGCT 
3 0481 TAATGACGCT AAGCATAGCC CCATTCCACA TTTCTTTTTA TTTCCACTTG CTAACTAATA 
3 0 541 AATTACGGAA TAGTTTATTG GGGAACATAC AAATAATGTT TAAAGGAGGT CAGATTTATA 
3 0601 GGTCAAGGGA TTTACCCTCC CAATCATTTT AATATTTTTA TTTAAACCAG G C ATTTTG AT 
3 0661 GGCCTTCTCT GTGCTGGACA AGGTATAAGT TTGGCTATGA AGTTTCACTC CTAAAGACCC 
3 0721 TATGTTTTGG GAAGGCAAAA AGGTAGCCAA ATAATTGCAA ATTAAAACCT CATAAGTGCA 
3 0781 AACTTCTTCC TCGTCACTTT CCCTATCTCG ATTCAAATAT TTGTTGAATG ACTCATTTTT 
3 0841 CTGCAAAAGT CTGAGAGAGA CAGGGAATAT AAACTTAAGT CTGGATAATA TGTTTTCCCG 
3 0901 GGACGCTCTT CCTGGTCTGC TGTGCCTGTT TGCTGTGCCT GAAATTCCAA ACACTCTTCC 
3 0961 CTTCCCTCCG TTTTTAATCC CCTTTCAACT TGCTACAGCT TTAGAGAAAA GAACATACGT 
31021 TTTGTACAGT TGGGGATTAA TTGAAGTGTA GGGCTAATAC TTGATTAAGG TCATTACAAA 
31081 ATCTACAGGG TCTTCCTCTG GGAGGTTTTT GTGATAAGAT TATTGGTGTT AAAATAAGGC 
31141 TAATCCCCTT GAAAAATAAA TAGAATAGCA GAATTGGGTC TGAATGTGGT TTGAAGAAAG 
31201 GGACTTCTCA ATTCAAAATT TTATTCTTAG CTTCCTGTGG GAGCTTTCCA GAATGCCCAT 
31261 AAGATC CACT TTTGTTTAAA AAACAAAAAC AACCCCACCC ACCACTCTCT GGTTAATAAA 
31321 TGAATTTCTA TTGGGAATAT TTAGAATGGG GCTGTGGCCT GTGAGAGACA TTATATAGTA 
313 81 ACCTCAGACT TGCTCACATG AAGAGAAGAA ATCCAGGAAT GGAGAAAAAA GACCCAGGAA 
31441 AGGCCAGAAT GCTCTACATG TCATATTGTT TGTATCACTT CTGAAATAAT TGATTACATT 
31501 CTTCTGCCCC AAATTGAGTT CTTAGGTTCT TCCACTCACT GTCCACATGC CACAACACAG 
31561 ACCTTATAAC TAGAGACTTA GCTAGGAAGA AATGTCAAAC ATTACAGAGA AAAAATGCAG 
31621 AGTCTGAGAT CATAAGTAAA ACTCTGAAAT CTCAACATGC CTTTTAATTC ATGAAAATAA 
31681 AAAATATAGC AGCATATGCA ATATGATAAT TCTCTGAAAA CATACATCAT GTGAACTACC 
31741 CTGGAACACA TCTCGCCAAG TGCCATCTTC ATTTTAACCA GAGGTCTAGG ATGCCTTTCC 
31801 TTTATTTTGC CTATTATATC ATTTATAAAA CCCCATTTTT ATTTTGATAT TTTATTTACT 
31861 TTCTATTTCC TGCTCCTAAT ATCTCCTTTC TAAACTTTTC TCAATGACAG TGACTCAAAA 
31921 ACAATGAATG TCAGAACAAA TATTTAAAGG ATCTGTACAT GTAGATATAT ATATTTAAAA 
31981 TGGATTCTTC CACTCTGGGA AGAATTCAGG CATACTCAAT CTTATGGTTA GGGAGAGATT 
32041 AGGCTCACTC GCCTAATCTG TATGGCTTCT CGTTCGCTTT CCATTTCACC TTCCTCTCAC 
32101 CCATCAGATC AAACTCATTC ATTGAACAAG AGACCTAAG C CCTTCAGATT AAAACTCTGC 
32161 AAACAAGTTG TGGTTGAGAG GATACATGAA GCATTCAAAC AAATAAATCT ATGATATTAA 
32221 TCAGAGGTTA ATCTATGATA TTAATCAGAG GTTAATGCAG TGGCTCACGG CTGTAATCCC 
32281 AGCACTTCAG GAGGCTGAGT TGGGAGAATC GCTTGAGCTC AGGAGTTCAA GACCATTTTG 
32341 GGCAACATAG CAAGTCTTCA TCTCTACTTA AAAAAAAATA AC CAG AGGTG TTATGAAAAT 
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32401 ATAAATTGTC CAGAACTACC CTCCACAAAC TAACTCTCTC AGAATATTCG ATATGAGGAA 

32461 TGAAATATGG TGTGTGTGTG TGTGTGTGTG TATGTGTGTG TGTGTGTGTG TGTATGCACC 

32521 TATATATGGC ACCTATATAT TCAACAAACA ATTCTGATAA TTGGCCAGGG TTGAGAATGA 

32581 CTAGCAGCCC AGCATACACT ATCAGTTTTA AGTATATAAT TGCGCTTTAG TAAAATGTAA 

32 641 AGAAATCCCA GAGTAGAAAT ACTTTTAAGC TATATTACAG GTGAGAAAAT G C AT AAGTAT 

32701 AGTCTCACCC AACTTAGACT ATGGGGGCTT TATAATGTCA CAACAGTTGT TTCCAGGCAT 

32 761 TTGGGGACAT CACCACTGGT CTTGGGCAAG AAACTCCTCT AGCCAATGGC TGATTTATCT 

32821 CACTCCCATC TAAGGCTTCA CTGCATTTCT CTTTTTCAGC AACCTAACTT ATTTAAAAAT 

328 81 ATCCATTTTC TGATTCATTT TTTTCTGAAT TAAACTGTCA GTACCATTGG CACACCTTTG 

32 941 GTTCCGTAGC ATACCTGTGT CTCTGCTGTG GTTTTTTTTA CCTCCACTCC TTACTTTTCT 

33 001 AGAAAAAAAT CTCTGCTTTT TCTTTTCAGT TTAAATTATT TCACAAAAAG TTTTCTTGAC 
33 061 TTGCACTTCC TAGGCTTGCT GTCCTTGTGT GGGCACGCTC CCATAAACAC TATTAATACA 
33121 CTTCGATTTG TTAAAAATAA AGATATCTGG ACAGAAAATT TCTTTTCTTT TTTTAAGATT 
33181 TTAAAATTTT TAATGTTTAT TTTTTTCCTA GACTGGAGTA CAGTGGCACC ATGATGGCTC 
33241 ATGGTAGCCT ACACTTCCCC GGGCTCAAGT GATCCTCCCA CCTCAGCCTC CCAAGTAGCT 
33 3 01 GGGACTACAG GTGTGCACAA CCACACCTGA CTAATTTTGT TTATTTGTTT GTTTTGTTTT 
33361 TTGAGATGGA GTTTCGCTCT TGTTGCCCAG GCTGGAGTGC AATGGCGGGA TCTCGGCTCA 
33421 CCGCAACCTC TACCTCCCAG GTTCAAGCAA TTCTCCTGCC TCAGCCTCCC GAGTAGCTGG 
334 81 GATTACAGGC ATGCATCACC ACGCCCAGCT AATTTTGTAT TTTTAGTAGA GACGGGGTTT 
33 541 CTCCATGTTG AGGCTGGTCT GGAACTCCTG ACCTCAGGTG ATCTGCCCGC CTCGGCCTCC 
33601 CAAAGTGCTG GGATTACAGG CGTGAGCCAC CACGCTCGGC CACTAATTTT GTATATTTTG 
33 661 TAGAGATGGG CTTTCCCTGT GTTGTCCAGG CTGGTCTTGA ATTCCTGGGC TTAAGTGATC 
33 721 TGCCCACCTT GTCCTCCCAA AATG CTAGGA TTACTGGCGT GAGCCACCAG GTCTGGCTGG 
33 781 AAAGATAATT TCTAACATTA TCCTCTCTTA AACATTTGTT TCAAAAATTT TACAAACATG 
33 841 AGAGTAATTA AATTTGATTT TCAAAATTCC CTTGAATACT TTCTTAATAG CACACAGAAA 

33 901 GCACAAAGTA TTTTACATTT GTTTTAATGA TGAAATTGTG AACCCAAACT TACACAAAGA 
33961 AAAACCGTAA CATTATACCC ATACTTAAAA CAGATGCCCT CATATACATA GTAAAACTCT 

34 021 TGGGGGCAGT AGTGAAGTTG GTTATTTACT GTTTTATGAA AGTGCCATTC AGCCGGGTGC 
34 081 AGTGGCTCAT GACTGTAATC CCAGCACTTT GGGAGGTCGA GGCAGGCTGA TCACGAGGTC 
34141 AGGAGTTCAA GACCAGCCTG ACCAAAATGA TGAAACCCTG TCTCTACTAA AAATACAAAC 
34201 ATTAGCTGGG CGTGGTGGTG TGTGCCTGTA GTCCCAGCTA CTCAGGAGGC TGGGGCAGGA 
34261 GAATCGCTTG AACCTGGGAG GCGGAGATTG CAGTGAGCCG AGATCGCACC ACCGCACTCC 
34 321 AGCCTGGGAG ACAGGGCGAG CTCCGTCTCG AAAAAAAAAA ACAAAAAAGT GCCGTCATAG 
34381 TGACTTAGTT TTAAGGAATA AATCAAGGAT ATTTAACTCA ATAGACTACA GTTAGCTAAC 
34441 GTGACTTGCA CTGAAAGTTA TACGAATATT GGTACTTATT CCCCTGCCCC TGAAGTATGA 
34501 ATTAAAGACT CCAAAATTCT TTTTAGAATC TTCAGAGTAA AAGCTAGAAT TTGATTTTTT 
3 4 561 TAAATAATAA AAAAATACTT TGTATCTAAA TCTGGTGTAT AAAATAACTT GGTGGATGAT 
34621 GCTTCAAGGC TATCCATCCC CAAATTTCTC CCTGAATGAT AAAGAGAATA AATGAATATG 
34 6 81 TCAATTCAAA AGTTAGAAAT TTGGCCGGGC ACGGTGGCTC ACTCCTGATA ATCCTTTCGG 
34 741 ACGCTGAGGT GGGTGGATCG CATGAGCTCC GGAGTTCAAG ACCAACCTGG G C AAC AT AG C 
34 801 CAGAACCCGT TTCAATAAAT AATAGAAAAA AATGAG CCAG GCGTGGTGGT CCCAGCTACT 
34861 CAGTAGG CTG AGGTGGGAGG ATCACTTGAG CTCAGGAGGT CGAQACTGCA GTGAGCCGTG 
34921 ATCGCAGTAC TGCACACCAG CCTTGGTGTC AGACTGAGAC CCTGTCTCAA CAACAACAAA 
34 981 ACAAGTTAGA AATTTGGCTG GGCGCGGTAG CTCACGCCTG TAATC CCAGC ACTTTGGGAG 
35041 GCCAAAAAGG GCGGATCATT TGAGGTCAGG AGTTCGAGAC CAGCCTGGCC AACATGGTGA 
3 5101 AACTCCATCT CTACTAAAAA TACAAAAAAA CTTAGCCGTG CATGGTGGCA TGCGCCTGTA 
35161 GTCTCAGCCA CTTGGGAGGC TGAGGCAGGA AAATTGCTTG AACCCAGGAG GCAGAGGTTG 
35221 CAGTGAGCCG AGATCATGCC ACTGCATTCC AGCCTGGGTG ATAGAGTGAG ACTCCATCTC 
35281 GAGAAAAAAA AAAAAATTCT GTATGAACTG AACAAAATAT CCTTAAATTT TAAAATACAT 
35341 CTGAAAGATA TTTCAAAATA TTTAGGAAAA AAATTATAGG GATCAGGCAA ATTCTGAGAT 
354 01 TCCTTTTTCC CTGCAGCAAA CATTAGGAGT GCTGCTGTTC CTAAAAACAT GGTAACTGTT 
35461 GCCACACCGT ATGTTTCCTT GGCTCAGACA TAAGGTTGTG TAGTTGTTAT TCCAGAATAG 
35521 CTAGAATAAA AATCCAGCAC ATCATTTTCT TCAGCAAGTT AACTAACCTC TCTGTGCCTT 
35581 GGTTTCATAA CAGCAACATA AGCATAACAG AATAGCAGCA ATAGCTCCTA CCTACCTCAT 
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CATGGGGTAC 
TTCAATC CAT 
TGTACAAATT 
TATCTATCTC 
TGCATGCATA 
AAATTGTCAT 
TTAATCGCTG 
ACAATTACAA 
ATGATAGTCA 
ATGAAAAATT 
AGAATGTTTT 
AATTTTGGCC 
TCAGAGTGGT 
TGATTCCAGC 
GGGCTTTTCT 
CCTTCAGGAA 
TATGGCCACC 
GGTGAATAAG 
TTATGAATAA 
ATTTCTATTA 
ATGTGTTACT 
ATATAAAACA 
GTAAGTGAAG 
TTGTCATATG 
ACAATGTCCC 
TCCGTAAGAC 
GATTAAGTAT 
ATGTACATTT 
CAAGTTGTTA 
TAGCTTTTCT 
CAAAAAAAGA 
AAGAAGAAAC 
CTTGTACCAC 
GGATATATAT 
GCACTTTGGG 
CCAACATGGT 
ACGCCTGTAA 
CAGAGGTTAC 
AAAAAAAAAA 
TGAATGTCCT 
GGACCTGGGC 
GCTTACTCCA 
AGGGAGGACA 
CTTGCGTAGG 
ATTCCAGGAC 
TGTTTTCTGC 
AGAGAAGAGC 
TGGTGTCTGA 
GACAAACTAT 
TTTTATTTTT 
CTCGGCTCAC 
AGTAGCTGGG 
AGATGGGGTT 



AGTAAGTAAT 

AAGTACAATT 

CCCGGAAAAA 

CATCCATCCA 

TTCTG C AC CT 

TCAAGTAATA 

ACAAAATACT 

ATGCAAAAAA 

ATATCAG C CT 

TTGTTAAATC 

TGGGGATTTT 

ATAAAATACT 

TGTACCTCTC 

TTGTTTTCCT 

AATTCAAATA 

CTACAAGTGA 

AACCCTCAGC 

AGGTGTCACC 

AATTTAAAAT 

AG CCAGAAAA 

AAAAAATTAA 

GTGAAGACTG 

AATG CTTACC 

CCTTATAGCA 

CTTTCAGAAT 

TAGAGTGAAG 

AAAATGTAAA 

ATACACAAGA 

ATTAAAGTAT 

ATAGTTTTTC 

ACAATGAGCG 

AAATGGTTTA 

TTCTGAGCTG 

ATTTATGCAG 

ATTAAGACTC 

AGGCCGAGGC 

GAAACCCTGT 

TCCCAGCTAC 

AGTGAGCCGA 

AAAGACTCTT 

GGTTAGATTT 

ATCCTCTAAG 

CTTCATTGAG 

CCGTGTGTGA 

GTCCAAAGAA 

CTTCCTTCAG 

TGTTCCCCTT 

CATCTCCTTC 

GATGTATCAA 

GAGATACTCT 

TTGAGATGGA 

TGCAAGCTCT 

ACTACAGGCG 

TCACCATGTT 
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AGCCAGGATG 

TGGGATTACA 

AATGGGGATA 

GTAAGTAGGT 

TTTTCATTAT 

AATACTAGAG 

TTCATGCCAA 

TGCTAAAAAC 

CCTCTTCCAT 

GGAAGAATCC 

AAGGAAAAGT 

TATGACTAGC 

GTCCCCCAAA 

TCTCTGAGGT 

ACAGTTCCAG 

AATGGGGAAA 

CAATTTCTAT 

CTCTTTGAGT 

ACGTTCAGCT 

GAGGCATCAC 

CCCATCTCAA 

AATG CTTATT 

TC AC CAT ATA 

ATCATAACCC 

CTCTTTTCCC 

TAAG CAAACT 

CTCCTACTTG 

ATGGAATCCA 

TCATTCTTTC 

TCTTCTTTAA 

AATAGCCACA 

AATAGTAGGT 

TATTTTCTGT 

TCCCATGGAT 

AAGAGACTTC 

AAAATTCCAA 

AGGGTG C C AG 

CCCTCAGCCC 

AGGCACTCCT 

CCCAGATCAC 

GCTCTTCCCC 

GCAATACGTC 

TCATGGTGAT 

AGGTCACATG 

GGAGAGGGGA 

TGGAGGAAGG 

TCCTAAG ATT 

TCACATGTAA 

TTTATGTGCA 

GGCATTGTTA 

TATGAAAGAA 

TTTTGTCCTG 

TTTTTAAGAT 

CAGTGCAACT 



GTCTCGATCT 

GGCGTGAGCC 

ATAGTACCTA 

GCTCAGAAGA 

CAGAACAAGG 

TAGCATCCCA 

TTAGAAAAAA 

ACCCATCATA 

TCGTGCAGTG 

CCAAGGCTTG 

TGAACGGGTC 

CACGTCCCAG 

TTTAAGGAGT 

GACGGAGGAA 

GGGAGAGGTC 

TCTTTTTGAG 

GTTTAGGTTC 

CTCTAGTTTT 

AAGACGTAGT 

AAACCTAGGC 

TTTAGACCTG 

GGCTTTCTAA 

AGGGAGATCG 

AATATCC CAA 

TACACCACAG 

CAGGGTTAAG 

GCATGGTTGC 

GCTTCTCCTT 

CTTTGACACC 

CTACATTTAC 

GTGACTTCTC 

GCTCTGAAGA 

CTCCCAGGGA 

GCCAGATCCC 

CCCCTTGTTC 

TGAACAAGAT 

ACGGTGAGGG 

ACCCCCTAAC 

CTCAACCCCC 

AATGAGGGGC 

AGGGGGTACA 

TTTAGGTTCG 

GTTCTGGGGG 

ATGTGTCACC 

TCTGTTTACC 

AAATACCCTT 

GGACTCTAAC 

ATATACATAT 

TTGAAAATGA 

TAAGAAG CGG 

ACTTTTAACC 

GTATTCATAT 

GGAGTCTCAC 

TCCGCTTCCC 



CCTGACCTCG 

ACCGTGCCCG 

TCTCATAGAA 

GTCGGACACG 

AGAGACCAGG 

AATGAAGGCA 

CACCTCTTCA 

CTACCCACAG 

TACAGCCTTC 

GTGACAGATG 

CAGAAAATGC 

GGTTCAAAGC 

CCTCTTCCAA 

ATGAAGGAAG 

ACAGCTAGGG 

GAAATGAACA 

AACTCTCTCC 

GTCTCCTTCC 

GCCCCATGGC 

ACCATCTTGC 

GGCACTATTG 

CTGGTCTCCT 

TGGTCCTCCT 

AAGACCCTTG 

ATGTTCAGGG 

GTACAGTAAT 

TCCGTCTGTG 

CAAGATCCAG 

CTAAGCATTT 

TTGCTATCAA 

AACCTCAAAG 

TGTTTGTTGA 

GCTGCTGGTG 

CTCTGCCCCT 

CTACTCACTT 

GACGACAAAA 

CTCTAAAACA 

AAAGAGCAGA 

AATAGATTTT 

TGATCCAGGC 

GCCAAGGTTA 

AACTCCTTGG 

TAGTAGTTCA 

TTCACCAAAG 

CTTGCCAGGA 

TTCAGAAAAA 

ACAGTGTCAC 

CTGTTACCCA 

TTGAATACAG 

ACTTGTAAGA 

TAAAGGATTC 

GATAGCTGAG 

TCTGCTGCCT 

AGGTTCAAGC 



TGATCCACCC 

GCCTACTTCA 

TTATTGTAAG 

AAGTAAGTGC 

TAGAAAATTA 

CCATTAAACT 

CAACCCCTTT 

AT AG C C ATG A 

ATAGCTGTGC 

AGTTACTGGG 

ATAGATACAT 

TTTTCTCAGA 

AAATAGGAAA 

CCTCTAGATG 

ATCACCGGCA 

GAGAAGGCTA 

TGAAACATGA 

CACAGTGAGT 

TCCTCCTGTG 

CTCTTCTCTC 

GATTTCAAGA 

CACCTCTCAT 

TTCTTAGGAT 

GACTCTGTAT 

GGTAGAAATG 

TATTTCTAAT 

TAGACCTCCC 

AAGGCTATCT 

GCTTCCTGCC 

TTTCATTCCC 

CCCCTGTACT 

ATTAGAGACT 

TCCCCAAAGA 

CTTCCCACTG 

GAACCCTGCC 

AC AG C AATTC 

GAAAAAGCAA 

TCCTCATCTC 

CTCAGCTCCT 

CTGGGTGCTC 

TCCAGCCCTG 

CATC C ATTGG 

AGGCCCGACA 

GAGGCACTTG 

AGACTGGAAC 

AACAAGCTAC 

TTGGAG AG C A 

TGTTCTTTGT 

ATGGTCAGTT 

TAGGTAGCTT 

TTCTACTCTG 

ATCTCTGAAT 

AGGCTGGAGT 

GATGCTCCTG 



GCTTTGGCCT 

CTTTCTTCAT 

AAGTGCATGC 

TTTTATCATC 

TTGTGATTCT 

TTGCAAATCT 

CAAGATATTT 

TGCTTTTTCT 

AACTCACATC 

TAACACAGAG 

GTGTAAAAAT 

TGTTAAAATG 

TGAAATGACA 

CAGCTTGAGG 

TGCAGGAACT 

AAATCAAGGA 

AGAGCTCATA 

CTGCAGGCTG 

GAGACAAGAG 

TTCCTTATTT 

ACCATTATCT 

CTAACTTCTT 

CCTTCAATGA 

GAGCTGGCTT 

CATAATTGGT 

CTCCCAGTAT 

ATCATCTTCA 

TGATCCCCAG 

TGCTTTAGGA 

TACCAGATTT 

ACCTTAAACA 

TTCATTCTGG 

ATATAAATGA 

TGCCCTGGGG 

TCTTCCTTAA 

CACTGATGAC 

GTTAAAGCCT 

ACTGCCATAA 

GGCTCTCATC 

CACCTGGTAC 

GTAGGTCCCA 

CTGCTTATCC 

CCGTAGAGTG 

ACAGGAAAGA 

TTTCACTTCC 

AGGAGAGACA 

GTCAGATCAG 

TCTGATAGAT 

TCACCTGGGT 

CAGTGATTAT 

ATAAGTGGCC 

TCTCTTTTTT 

GCAGTGGCGC 

CCTCAGCCTT 



CCCAAAGTGC 

TTAAAAAAGA 

AGTAATGCAT 

CTTATCATAA 

TCAGGTCTGG 

GTATGACACC 

GCCTCCTACC 

GGGACAGGTG 

ACAATCAGAT 

AGAGGATTCA 

CTGGTAAGGT 

AATCATGTAA 

TAGGTGTATG 

TTCATGAGAG 

CAGAAACCTA 

GTTCGTCAGG 

AATGCACTCC 

CGTGTCACTC 

ACCCAGGAAA 

TCCTCATTCA 

CTCATCTGGA 

AACAACACAT 

CACCCCAGTG 

CTTTCTGATT 

GAGTGATAGC 

GCCTTATACT 

ACCTCACCTA 

CTGAATGTGA 

CCTCATGGGG 

GGGTTCTGAG 

GCTCTTGCAA 

GGAGAACCAT 

GAAAAATGCT 

CAGAGGTACT 

TATTATGAAC 

TCCAATGACT 

TTGATTGCCA 

TTACCTCCTC 

AGTCACATAC 

GTATATCTCT 

TCCCCATTGG 

TTCAGCCACT 

GTCACTGAAG 

GGAAGGATGA 

TTCTATAGGT 

CCATTTTGTG 

CTTGTTCTCC 

AAAATTGCCC 

CAACCTAGGA 

TGCTATGTTC 

TCACTTGATA 

GATCTTGGCT 
CCAATTAGCT 
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GGGACTACAG 
TCACCATGTT 
CCAAAGTGCT 
TAACAGGTAT 
TCCCTTTGAG 
ACATCTCAAT 
AGGCACACAG 
CTCCACTCTG 
AAAACACCTC 
TAGGCCCTGT 
GCCCTGGGTT 
CCATCATACC 
AGGATGACCT 
AGGAATAGGT 
TTCCCTCTTC 
AAAAGATGAA 
TGTGGTTGTG 
TCAGACTCTG 
TTCGGGGCTC 
AGCCCAAAGC 
AGTGCAGAGA 
GGAGCAGGAT 
CCTCATTTTG 
CTCTTTCCTT 
TCCTATTCCA 
TTAAGGTGTG 
CCAAATCCTG 
GAGACAGAGT 
CAACCTGCAC 
TATAGGCGTG 
CCATGTTGGC 
CAAAGTGCTG 
CTGAACTGCC 
TCCAAATGCA 
AAATGGCATT 
CAACTTTAAC 
AGG CTGAAGT 
AATACCCTCC 
GCTAATTTTT 
ACTCCTGAGC 
AGCCAATTCT 
TATTCAGGAG 
CTCTTGCCTG 
TTCATTCTGC 
GTTACTCCTT 
GCTCCTCTTC 
TTTTTTTCAA 
TTAGAGACAT 
GGACAGGATA 
CGCCCCCCCC 
ACTTGCCAGG 
TTCCTTCATG 
CTCACAACAC 
CTCTCCCACT 



GTGCGCATGA 
GGTCAGGCTG 
GGGATTACAG 
AAATATACAA 
CCATATGCAT 
TATAAGGTAG 
CCTCAGCCAC 
CCACTAGAGT 
TCCCCAGCTC 
TCTGCCTGGC 
CTGCTGCTCT 
CGTACTTCCA 
GCAGGGTGTG 
CCCTATTTCC 
CCTGCTCCCA 
AAGCTCTGAC 
ATTTTCCATA 
ACTCAGCTGC 
CACACGGCGA 
TTCAAACAAG 
GTGTGAACCT 
GTTGAGGCTC 
TGAAGGGTGA 
GTGTGCTTTT 
ATACTCATGA 
TCTAGCCATG 
AGGAATAATT 
CTCACTCTAT 
CTCCTGGGTT 
CGCCACCACA 
CAAGCTTGTC 
GGATTACAGA 
TATGTGGCCT 
GATCCTTGAT 
GCCAATTACC 
TCTTCTCTTT 
ACAGTGG C AC 
ACCTCAGCCT 
GTATTTTTTG 
TCAGGAATCT 
TCTCTTTCTC 
ACAATGGTTT 
GACTGTGTAA 
TTTCCACATA 
GGCTAGAATT 
TGAGCCCATT 
CCCGAGCTTC 
TTTTGGTTGT 
CTGCTAGACA 
CCCACACACA 
CCCACTCAGT 
TCTCTGCTCA 
CCATGATTTC 
AGAATGCAAA 



CTGTGACCAG 
GTCTCAAACT 
GGGTGAGCCA 
AAGATTATTG 
GGAGAAAAGA 
AGACTCTAGG 
CTCTGAAACT 
ATAGGGGCAG 
CAGCAACTGC 
CCGAATCTTG 
CCAATCCAGT 
GTAGCCCTCG 
GGACTCTGGA 
ACCATCCCCA 
CAAGACCTCA 
AACCTCAGGA 
ATAGTCCAGA 
AGCCACATCT 
CTCTCATGAT 
GAAAGACCAA 
GGAGACAGAG 
CACACACCTG 
GTTGCAGTCC 
CTCTGCCACA 
TTAGACAGAC 
GTAGTTGAAC 
CCTTCAGTTT 
CACCCAGGCT 
CAAGGGATTC 
CCAGGCTAAT 
TCAAACTCCT 
AGTGAGCCAC 
CACCACTTGG 
TTACCCCAAA 
CCACTGCTCA 
TTCAGGGGGT 
AGTCATGGCT 
CCCGAGTAGC 
TAGAGAAGGG 
GCTCTCCTTG 
TCACACAACA 
GTCACTCCCT 
CAGCTTCCTG 
GCAGCCAGAG 
CACACCACAG 
ACCTACTTCT 
TTAAC CAGG A 
CAAGACTGGG 
TCCTACATGC 
CACACATGAG 
TTGCCTGGGA 
AGTGTCAGCC 
CTGATGTTGT 
ATATCAAAGG 



CTAATTTTTG 
CCTGACCTTG 
CCGTGCCCGG 
GTTAAATAAA 
AATTAAACCC 
ATTGAGAAAG 
CCAACCAGGG 
AAGTGTGTTT 
TGCAGCTGTG 
TGCCTTTCCC 
GTGTCAGGGC 
GTACTGTTGT 
AAAATCCCCA 
AGGACCAAAT 
GACTTCCAGC 
AGGTGAGGCC 
AGTCAACAGT 
GGCTTGAAAT 
CATAGAACAC 
GGTCCTGCTC 
CAACAGGCCT 
CATCAACTCA 
TGTCTTTCTT 
CGTGGCTGCC 
TCCACTAAAG 
TCAGGAGTTG 
* >j< ^T'T^T* 
GGAGTGCAGT 
TCCTACCTAA 
TTTTGTATTT 
G AC CTCAAAT 
CGTGCCCAGC 
AAGCCTGACT 
CTGCTCTTTC 
GGCCAATAAA 
CAGGGGAGAC 
CACTGCAGCC 
TAGGATCACA 
GTTTTGCTGT 
GCCTCCTCCT 
TAGAATC CTT 
TTTCTGTTCC 
GCTGGGCTCC 
CAATCTTTTA 
CCTACAGGCG 
TGGCCTCTAC 
GTTTGTCTAC 
GGAGTGCTCC 
AGATGGTAGT 
TAGTGCTGAG 
AATACTGCTC 
CCAGAGTGAC 
ATATCTTTCT 
GTAAAGACTT 



TATTTTTTTA 
TGACCACCCG 
CCTTGACATT 
AAGCAAGGGC 
ATGACTTGTG 
TCCCTTCCCA 
ATTCCGTGCC 
CCACCATACC 
CAGGGCAGTC 
ACTCCAGCTT 
AGAATTCAAG 
CTTCTTGCAT 
GCCTTGTTAA 
GATCTCAGGA 
TGTTTCCTTC 
CCCTCTCCAC 
GAACATGTGA 
TCTACTGGAA 
GAACAGCTGG 
TGAGG CACCC 
TAACCATGTG 
TAC CATCAGC 
CCATATGACA 
ACCCCCTCAC 
CTGGTGGATT 
GTGCTCAGGG 
TTT TT T TT TT? 
GGCACAATCT 
GCCTCCTGAA 
TTAGTAGACA 
GATCTACCTG 
CTTGGTCCTG 
GGAATCTCAA 
CTCTGCCTTC 
ATTAAAATAA 
AGGGTCTTGC 
TCAACTTCCT 
GGTGCATGCC 
GTTGCCCAGG 
TGGCATGAGC 
CAGCAACTTC 
CACCCAGCCC 
CTGCTTTTAC 
AAAGCCTGTG 
CCTGCACAAC 
TCCCCAGCAC 
TAGGTGACAT 
TAGCACCTAG 
CCCCCTTCCC 
AAAACCCGCT 
CCAGTCAATA 
TTGCCCTGAC 
GCTCATTTGC 
GTTTCCCTGC 



GAGACGGGTT 
CCTCGGCCTC 
TCTGAATTTT 
CATAGACACT 
GCTGTCTCAT 
GAATTTGGAG 
CTGCAACCTC 
TTGTTGGTCC 
CCTCTCCAGG 
GGTGGGCCAG 
GTGGTCCTGC 
TTCACAGCCC 
CTGCAACCAA 
AGCAAATTCC 
AAGATGCATG 
ATACCCTTGC 
TCCCACCCTT 
ACCCATGGAG 
TCATCCACGT 
ATGAAGAGGT 
TAGTAGGAGG 
TGTGTCTGGT 
GTCCTGGGTG 
TGCCCCCAGA 
CTAGAAAATG 
CAAATTAGAC 

CAGCTCACTG 
AACCTGGGAC 
TGGGGTTTCA 
CCTCAGCCAC 
AATTCTTACA 
ACTTAACATG 
ACCATCTCAG 
AGAACAAAGT 
TCTGTCACCT 
GGGCTCAAGC 
ACCACACCCA 
CTGGTCTTGA 
TACTACACCC 
CTTCAGAATA 
ACTCCACTAC 
TGTTGCTCCC 
ACAGATCACT 
CTTGTTTGTG 
TACTTGTTTA 
GTGGCAAAGT 
TGAGTAGGGA 
ACCCCCACGC 
TTTTAATCCA 
TCATTCTTAT 
TTCTCTGCTT 
TTATTGTCAT 
TCTCTCCCTT 
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GGGGCTTGAA 

TCTGCTCAAC 

AGCTATGAAG 

GCCAGGAAGC 

GTACCAAAAC 

ACTACAGGGC 

TCAGAAAACC 

CAGTTAACTT 

ATCAAATAAG 

ATCAGTACTT 

GCCGAAAAGA 

CAAAAGAACA 

CACCCTAGAG 

ATCCCCAACG 

CCCGTGCGGA 

TCCTGTTTGG 

TCCAGGTTCA 

AAAAGCATTC 

GACAGCCGGG 

GGCACTCCCT 

AGCAGGTCCT 

CCACCCCCGC 

GCAGGACCGC 

TTTCCCCAGC 

ACGGGGATGG 

AAACTTCCTG 

GAAGGTACCT 

AAGGGGTGTT 

TTAAATATTT 

ACCAACTTAT 

AAATTAATTA 

CATAATTGAA 

ATGATTAAAC 

TTAGGTAGTT 

CTAAGACAGC 

GGATGGAGTT 

CAGGTGGAAA 

ACCCCAAGTT 

CCTGGGAGGC 

TATAGCTATA 

AACTCCTCCC 

CTTCGCAGAA 

TTTAAGCTTG 

TTGCTGCTTC 

TAATTCCCAG 

TCCCTCAACG 

ATACACCTGA 

TAAGCATGTA 

TGTAGGCAAA 

GGGCTGATAG 

AATTCATCTA 

GCACTTTGGG 

CCAACATGGT 

GCAACTGTAA 



CAGTGCAACA 

ATGAAATTTT 

TGGAGACATG 

AGGCCGTGCC 

TCAAAGTGCT 

AAAATCACTG 

AGATTATTTA 

CTAGTAAAAT 

ATCAGATATG 

TGTAGAGAGG 

GAATTGTTCA 

ATTGCCAAGA 

ACCTCCACCC 

CCACTCTTTC 

TCCTGCTGTG 

AGATGACTGG 

AGGAGCCCCA 

TAGTTGGGGG 

GGAGGGGGCA 

CACGGGGTCT 

CCAAAGTTAG 

CGCGCCCCTA 

GGTCTGCAAA 

TCTGGCCGCA 

CTCCAGAAGT 

GTGAAAAGCA 

GCTTGTGAAA 

GTTGATTGGG 

ATGATTTTCA 

GTCTTATTTG 

AGTCATAACA 

TTATCTGACA 

ATATTGAGGC 

AGACATTAGC 

CACTGGCCCA 

TATGATAAAG 

TTCCCTAAGG 

CATCATG C C A 

TTTTCTTAAC 

AACTTCAATC 

CACAAACCCC 

ATAAGCCCGC 

CATTTTGGTG 

AGAGCTCCGG 

TAACAGTATG 

AAGTTTGGGA 

TTTGCTCCAA 

TGCAAATTTT 

CAGATACAAT 

TACTATTCCT 

TGTAAAATGC 

CGGCCGAGGA 

GAAACCCCGT 

TCCCAGCTAC 



CATGGCTGGG 

ATTATTCAAC 

AGCTCTGCCA 

ATGCCTCATT 

GTGCTGAGGC 

TCAACTAAGA 

TGTTCTTTGT 

AAACGTATTA 

AATGTAACTT 

CCTCTTAATT 

GTTCAAACGT 

GTGGGGAAAG 

CAGGTCTCAC 

GCGCCCCCAC 

GGTTTGCTCA 

GGAAAAAACT 

GGCTTAGCTC 

AAGGGAGTGG 

GGTCCTGGGG 

GGACGCAGAA 

CAAACTCCCA 

GTTCGCCCGC 

AGCATCAGGA 

CGTCCCCGTT 

CACCCTACAG 

ACAGGTCTTT 

CACTAGGTGA 

GAAAGTAGCT 

AAATTCAATC 

ACTTAGAAAT 

TTAACCAATT 

AGTGTTTCAC 

CTGCTCCTAA 

AGTTGGGAGG 

CCTAAATTCA 

TCTGTGGCCA 

TGGCACATGC 

TCATTATAAT 

AAATTATAGG 

AAATAACATC 

ATAAAAGCAC 

TGTCCCTCAG 

TTAGTTTGTA 

CTATAATAAT 

GGATGCCACC 

ATTATTGCCT 

ACCTTTACAT 

GGCAATTCAA 

AACATTGGAA 

TTTTTTCAAT 

AAAAATTGGC 

AGGCAGATCA 

CTTTACTAAA 

ATTAGAGGCT 



ACTCATTTAC 

CTCTAATGCA 

CCAAAGCCCC 

CTTGTCATGT 

CGGCGTGTGA 

TTAGAAG CAG 

AACCTGAAAA 

TTAGCTCCTA 

AGAAGTGAGT 

ACACAGCACA 

TCAAAACTAA 

GCCCGAGGTA 

CAAAAGTGGG 

CGCCCAACGC 

GCCTTCTCGG 

GCACAGCTGA 

AGCTCAAGTG 

GCGGTTCCAA 

CGAGGGACCC 

AGTAGGGAGA 

AGCG CAAAGA 

AGCCCTCGGA 

GGAGAAGCGC 

AAATCTCCGC 

CTATTGCCTA 

CAGAACTTTA 

TCCAGTGTCC 

TCGCAATGTT 

ATACATTTAA 

ATAAAGCTTT 

AGATCCTACT 

AAACTTTACA 

CCCCAGACAC 

GGATGACAGA 

GGCCCAAGAC 

AAATATCCTG 

CCAACAACAC 

AGAATTTACA 

TAAGACCATG 

ATCCTGTCAC 

CTTGAGCTCT 

AGTGTATTAT 

GTTCTTTGCT 

CTCCTCGGTT 

TGGGCAATGG 

TAGACATTTC 

ATCTAG C AAA 

GAAAATCAAA 

ACATGTAGAA 

TTTTGGTAAG 

CCAGCTCAGT 

CCTGAGATCA 

AATACAAAAA 

GAGGCAGGAG 



ACTTGTAAAC 

GTGTGATGTT 

GTGTACCATT 

GTAAAATGTG 

CCCACAGAAC 

CTGTAGTACT 

GAGTTATATA 

CCTCCCTATG 

GCATTG CTTA 

TTGCAAATCA 

CATATACTTA 

GGCCTCTCTC 

TGGAATGGTG 

ATTCGTTCTG 

CAAG CACTCA 

CATTGGAAAT 

AGGAACTACG 

AAGTCACTCC 

CTATCTGCAG 

GGGGCTTGCG 

AAAAGCTAGT 

CTCACGCAGC 

CGGCCTGGCT 

TTCTTTTGGG 

GGCTCAGGAG 

GTTCTCTCTC 

CCCTTGGTTT 

CTGATCTGAA 

AAATTTTATC 

TTCATTTTGT 

GAAACACGTT 

GTATTGGGAT 

ACTGATTTAA 

AGAGAGCGGA 

TACCCTAATG 

GAGAAAGAGA 

AAAAGCCTGT 

TACAGTTTTG 

CACAGTTTAA 

TCAGATACAG 

GTAAAGAAGT 

TGTGCTTCAA 

CACTATCACA 

AAAGGATCCA 

GATTTTAAAA 

AAACAATATT 

TTCAACAGGC 

CAGGATATCA 

TATTGATGAT 

ATATAATTAG 

GGCTCACGCT 

GGGGTTCGAG 

TTAGCCGGGC 

AATCGCTTGA 



AATGAATATT 

TAAGAATCAT 

GAATAAATTT 

GATACACGTA 

ACTGTGCTAC 

TGAAATAACA 

ATCTGAATTC 

C CT AGTG AAA 

CATGTTCATT 

ATAAAGC CTA 

ATTTTCCAGG 

AGGAGCCTCC 

AAGAATTCAG 

AGGTGGAAAC 

GGGAAGAACT 

AAACCCGAGT 

AGATTTATTT 

GCAGAGCCGG 

TTCAGTGGTA 

GATTGGGTTG 

TTCGATTTTT 

AAGCGCCCCT 

CGCGGGCCCA 

GGGCGGGGAA 

ATGCCCAGTA 

TCCTACAGCA 

TTAAATCCTG 

CTTTAGATAT 

TCAACCTTAG 

TTTTTGATTC 

CCACAGCCTT 

TATCTGGAGA 

TGGGTAATTG 

AAGGCTGTCA 

CCACCCTAAG 

AAGGAGGGTA 

CTTCAAGTTC 
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51841 AGGTCAGGAG TTCTAGACTA CTCTGGCCAA CATGGTGAAA CCCCATCTCT ACTAAAAACA 

51901 AAAAATGTTA TCCTAGCCGG GCATGGTGCC TGTAGTCCCA GCTACTCAGG AGGCTGAGGC 

51961 AGGAGAATTG CTTGAACCCG GGAGGTGGAG GTTGCAGTGA ACTGAGATCA CGCCACTGCA 

52021 CTCTAGCCTT GGTGAGAGAG CAAGACTTGG TCTTAAAAAA GAGAAAAGAA AAATGAAATT 

52081 TCAGCATTAT AGAATAAAAA TGTTTCCCCT TCCCCCCAAA CTTTAAAAAA GCAGAAGTCT 

52141 GCATCATAAA ATGGTCTTTG CCAATGTTAT TTTTATTATA ACAAAGGAAT CTTGCAAGGC 

52201 TACCAGATCT CAGCAATTGT CACTATGTTC TGTAAAAATC ACTTCCTAAA ATGTCTGAAT 

52261 TGACTGCTTG TCTCATTTAT TTGTTTCTCG TGTCATACTG CAATGGATAT CTGTCTTGTT 

52321 AGTATAAATA TTTGTGCATT TTGTTGTTGT TAAAACAGCT TTTTTGGCCT GTCTTCTTCC 

52381 ACCTATGAGG TAATATAAAA CTCATGTTTA ACACTTATTT TTGTAGGAGG ACAAGCTACA 

52441 GACAAAACCC CTCAGACACT GAGTTAAAGA AGGAAGGGCT TTATTCAGCT GGGAGCTTTG 

52501 GCAAGACTCA CATCTCCAAA AACCGAGCTC CCTGAGTGAG CAATTCCTGT CCCTTTTAAG 

52561 GGCTTGCAAC TCTAAGGGGG TCTGTGTGAG AGGGTCATGA TCGACTGAGC AAGTGGGGGT 

52621 ATGTGACTGG CAGCTGCATG CACCAGTAAT CAGAACAGAA CAGGGATTTT CACAGTGTTT 

52681 TTCCATACAA TGTCTGGAAT CTATAGATAA CATAACCGGT TAGGTCGGGG GTCAATCTTT 

52 741 AACCAGACCC AGGGTGCAAC ACCAGGCTGT CTGCCTGTGG ATTTCATTTC TGCCTTTTAG 

52801 CTTTTACTTT TTCTTTCTTT GGAGGCAAAA ATTGGGCATA AGACAATATG AGGGGTGGTC 

52861 GCCTCACTTA TTCACCCCCT TTGAGAATCT CACTCATTAG TGGGAGTTCT CACTTTTATT 

52921 CTCACTACCT ATGTCTTCTT GAAAGACAGA TTGATAATGA TTGATATAGT ACACTTGTGC 

52 981 TGAAGCATTT TGGTGAGCTA AGGTAGTGAT GAAGCTTTTT ATCATTTGGA GAAGTACAGG 

53 041 TAGCAAACAA GGAAGCAGTA AGCAGGTTTC TATTAATATT ATAACTCCTA TTATAAGAGT 
53101 TTTAAATCTT CTTAGCACTC GGAACCATTT TTCAAACATG GCCCCAGAAA CAAATCCATA 
5 3161 CCACACCTAC ATGGGCACAT GTGCCACTTT TGTCATATTT CTAACTATGT CTTCAACTAC 
53221 TTGCCCTTAA TCATCTATGT GTAGAC AG C A ATTAGTAAGG TTAAATTTCC TACAGACCCC 
53281 TCCTTCAGTT GCTAGCAAGT AGTCGAGAGC CAATCCATTT TGATAGATAG CATTTTGCAT 
53341 CTGAGTTTCT TGCCAGGCCA CAGTAGTCAG GGCTCTGCTG GTCTTATTAG TAATTATTTC 
53401 TAAGAC AG CT TGTAAC CGTA TGATTCAGTT GAGCATGTAA ATGGGGGTCC CATATCCCCA 
53461 CAAGCCGTCT TGTGCCCAAG TAGCAGGCCC ATAATATTGT ATGATTCTCT CAGGGGGCCA 
53521 TTCATTATTT TTCCAATTTT CTATAGCTAT GCTTTTTTTT TTTTTTTTTT TTTTTTTTTT 
5 3581 TTGCGGGAAG CATATACAGG GAAGCCCAGG AGTTTGCCTG TCTTTATGGG CAGTAGGAAG 
53641 AAAGATGGTT TAATAGTGTC AATAACACAA CTACCTGCCC ACTGGTCAGG TAATTTGGCA 
53701 TAAG C TGTAT GCCCACATAT CCAGTATAAT CCAGTGGGGG CTGTCCAGTC CCGGTGGGAC 
53761 TCTGGGTGGG TCCACACAGT TTGCAACTTT GGGAATTTAC TAAATAGATT TTTCTTAGTG 
53821 TGGTTTGAAC TCCACTAGGT GGCTGTTTTT ATAGTACTAT TATACAGTTT TTGCCCAAGG 

53 881 CAGCTGAGTC TTCCCACAGG AAGGGTGAAG TCCTTCCCCA CTTTTGCTAT ACAGTATTGT 
53941 CTAATGATTG AGGCTTTTAG GACC CAGAAG TTATCAGGGT GAGTCTTTTG AGCTGGGAAT 
54001 TTATCAGGAA CTGGGTCTGT AGGTACTAAT TCTCGTGCTT CCCATGGCCA TTGATCTCCC 
54061 ATTACAGTTC CTCCACATAC ATACATAACA TGAAGTGACA TTGAGAGACT GGGCTACATG 
54121 CTCAGCTAAT TGCAAAAACA AATTTCTTGT TTTTCCTGGA ATTTCTAGTA C TGGC AC ATT 
54181 CAGTTCATCA TAAGAAGGTT TGAAATACTG GCTCAGGGGA GCATTTATAA ACTTCTCCTC 
54241 AAAC CACCAT ATTTACTCAA GGATCCAGTC CAGCCCCAAC TATTTCTAAG GTTACACGAT 
54301 CCCCTTTTTT CCAGTGAGAA TCAAGGGGGT TGGTTATTAC TAGTTCTAAG GGGTTACACT 
54361 GACCACTGGT ACAGGAAGGG CCACTTTTCC CTTTCTGAAG GTGGACAGGA TTCTTTTTAT 
54421 TTTTTAAC C A AGTTGCCTAA ATGACACAAG ACCAGTATCT ACATTTATTT CCACGCAGTC 
54481 TTAATTCATG ACAAGCGTAC TTATTTTCTG CCATATAGCC TCTTTCCTAA TGAACAGAAC 

54 541 CACATCCTAT TTCTAACTTA TTACTATTAA TGACAGCACA GGCATCAAAT TTCAAGGTGA 
54601 CTTGTTTGGG CATTCCTTTT TCTTCTGTTT TGGCTAACAC TTTACTCGTA TCGTTTATGA 
54661 ACCCCCACCA GTCCTCAGTC CTCAATCTTA TTTCAAAAAC TGTGGTCGTG GGAGGCTCAG 
54 721 ATGGGTCATA ACACACATCA GGTTGGTCAT TTCTTGGGCT ACCTGCCTTG TATAGAATAG 
54 781 CATTATACAA ACAAGTTATT TTTAGAGTCT TTGTACACTT ATAATAACCA TAAAATAATA 
54841 AGACTGTAGC AACTTTTTGT CCTACCTCAG TGACTTGATG TATACACTGG GAACAG CCCT 
54 901 CAGTCTGAGG AAGGTTAGTT GAAGTCTTTA CTGTGCAAGT CCAAATTTTA AGGAAAATGA 
54 961 GTCCCTTGAT GAGTTTTCTC ATGTTTCGGC CATGCATGGA CCAGTCAGCT TCCGGGTGTG 
55021 ACTGGAGCAG GGCTTGTTGT CTTCTTCAGT C ACTTTG C AG GCGTTGGCGA AGCTGCCACG 
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5 5081 TACAGCTCAC AGTCTACTGA TGTTCAAGGA TGGTCTTGGA AGTTGGGCCC ACTAGAATTA 

55141 ACTGAGTCCA ATACCTCTAC TCAGTCACTT TCAACTGGGC TTTCTGATAC CAGGAGCAAG 

55201 GTGGCAGGTT TTAGGGTGTT GCAAATTTCA ATGGTTATGC AGGGATTTTC ACATAGCAAA 

55261 CTTTGGTACT TGGTTAATCT AGCATTTGTT AGCCAATGAT GTATTTATTA AAGTCACCAC 

55321 AGCATGGAGG GCCTTTAAGT TTAGGTTTTG TCCAAGAGTT AGCTTATCTG CCTCTTGTGC 

55381 TAGCAGGGCT GTTGCTGCCA AGGCTCTTAA GCATGGAGGC CAACCCTTAG AAACTCCATC 

55441 TAGTTGTTTG GAGGCCCAGC CTCGGCCAGG GCCCCACAGT CTGGGTCAAA ACTCCAACCG 

55501 CCATTTTTTC TCTTTCTGAC ACATAGAGTG TAAAGGGTTT TGTCAGGTCA GGTAGCCCCA 

5 5561 GGGCTGGGGC CGACATGAGT TTTTCTTTTA ACTCATGAAA AACTCATTGC TGTTGGTTGT 

55621 AATAGATGTA GTTTATCCAA TCTACATTTT TATTAACTGT CACCCACCAA AATATTGACT 

55681 CAAATCCTGC AGCTATTTGA TTTTGGGATT TAAATTGATC TGCTATTCCC TGTGGGACTC 

55741 CAATTGCATC TAAATAGATG TGAGAGTTGA AAGACACATA AGGGTCTTCT CTTGCTTTAC 

5 5801 GATGTCTTAT TTTTCCTCCC TCTGGTTGAT GAAATGCTAG GGTGAAAGGG ATAGCCAACT 

55861 GGACTAAAGT ACAAGTGCCG CTCCAGTTAT TTGGCAGAGT GCCCAGTAAA GGTCCACCAC 

55921 AATACCACCA CACATCCGCT TGGGGATGAA CAAAGGCTGA CTGATTGAGA AGCTCCTGAA 

5 5981 AATTCTTAAG CTCACTGCAT CCCTTCAGGT CTCCAAGGAA TGCTAAGTTT CCTCCCTGTC 

5 6041 ATGAGAGACA AGAAGTGAAC TTAGTTTTGG GAGATGGAAG CTGGATGGCC CTCAGGGGTT 

56101 GACCTGCAGG GTGCTGGACT TTGGGATATA GCAGAGAGAG CTTGGCACGA CTTATTACTC 

56161 CAGGCTGTAG CATCCTGGAA AACAGTTACC ATGCAGCCCA TGCCTGGTCA ACAGGAGGAC 

56221 CACCTTAGTG GAAAGGGGAT AATCTGGCCC TCTGGCCTGC CATGTGCACA AGCATAACAA 

5 6281 TTGGTTTTGT TTAATGTGTG GACAGAATAT TTGATCCATT CCAACTGGGC ATTTGCATCT 

56341 TGGTATCCTG CTTAATTATC AAAGT TTGTT TTAAGTCTTT AACTTCTATG ACCCTCTAGT 

564 01 AAAATGAATG TATGATTTTA GGAAATTACA AAAACCGGTT GGGGCAGTCC ATCCTCGCTC 

56461 TTTAGTGGTC CACACAACAT TCGACCAACT ATGG CAT AAA AGCTCTACAT CAGGGGGCAA 

56 521 GACTCCTCGT TGACACTGGG GTCTTTATTG AAATCTCTCT GGATTAAATG GTCTCAGTTT 

56581 ACTAAGGCTC AGTCTGAGGA GAGTCAGGAG GGACAGAGGT ACTTTTCTGA AGTACAGAGA 

56641 TGTCTTCGAC TTGGCAAGTC CCCACAGGGT ATAACAAGGC AAG CATTAAA TTCAATAGTT 

56701 TGAGGCAAAA TTGACTTGGT TATGTTAATA ACTAGATGGT CAGAAATAGA GTGAGGGAAG 

56761 AAGAAAGAGT AATAGAATAG ATGAAGGAGT TAAATTTTTC TTAGCTTTAG TTTGGTAGGG 

5 6821 TTTTCCCCTG GGACTATGGC CCATGACTCT GGAGGGGGTG GCACTTTCTT GACTCGGGTG 

56881 TGATGAGTCC ATCCCTTTTT CACCGTATGA ACAACAGTCT CGGTGGTTAG CAGCACAAGG 

56 941 TAGGGTCCTT CCTAGGCTGG CTCAAGTTTT CCTTCTTTCC ACCCTTTGAT GAGAACATGA 
5 7 001 TCTTCAGGCT GGTGCTGGTT TACAGAAAAT TCTAGGGGTG GTACATGTGC TAAAAGACTT 
5 7 061 TTAGTTTTGA GGGAAAGGAA AGTGGAAGAT AAACCAAGTA TATAACTTTT AAGAAGTTGA 
5 7121 CCTTTTGTTT TAAATGTGGG GACATCAGCA GTGGACTTTA TAGTCCTTGG TGCCTTCTTA 
5 7181 CTGAGAAATT TCCTTTAGCA CCTATTTTTA TTAGTTTTTA GACCAAAGAA AGTCAAATGC 
5 7 241 CATTTTATAT TTGACAACGC TTCTTGTATG TTT AT AC C AG ATAAGCTAGA TTTCACCTTT 
5 73 01 ATATTGGTGT GTTATTAATG TTAAACTTAG TTTTAATAAA ACTCTGTAGA CATATTTATT 
5 7361 TGATTTTTAA TGTCTGACCA TAAGGTAAGA TTTTTATAGA CTTTTCTTTA ACCTTTTATA 
5 7421 ATTTTTGTTA AAGAACAGGT TAGTGCTTTA AGAAAAACCC GTTGTGTTTT TATTTTAATG 
5 7481 TTCAGTTCAC AGAAAAACTG TATGATACCC CTTAACTTTA GCCAATATGT TTAGACACAG 
5 7541 AATTTTCTTT ACAATTAAGG TTTCAAAACT TGCTTAAACC TTCAAAACAA TTTTTGTAAC 
57601 CTTTTAATGT AGGTAAAAAT C C AC ATTCTT ATGCATCCTC ATAATC CTTT TACCAAAGGT 
5 7661 ATATTTTACT TTCCTTACAT ACCTTGCACA TAAACTGTTT ATTCAATAGT TTTACATTTA 

57 721 GAAGGAGGCC TAATTACTTT TAAATTATAC AACATTTCTT GCATAAATTT ATTTTTCTAA 
57 781 CACACATTTT TTTCATGACT TTCACAGACA ATTCTTCGAC ATGCCTCAAC TTTCTGACTT 
5 7841 ATTGCAAACA TCCCTTTCTT TAAACAACTA GTTAATTTAT CTCAGGACAA GGATTTTCCA 
5 7 901 TACAACATTC TTTTTTATAT AAATTCTGCC TCCTCTTTAT TTCCTTTTTT TTTTTCCGAG 
5 7 961 GATGATAACC ATTCTTTTCC AAAG CGAACT TCTTTTATGT CTGTGGACTA GACTGTCTAA 
58021 GGCCACAAGA TTAGAAGTTA CTATAATACA TGTTACACTG TTAACTTTTA GCAAACTTTA 
58081 CTTTTGTTGA AAACCTTGTA AGTTTGGGAT TTCAATTATC CTTTGCTATT AATAAGACCT 
58141 TATTTAGTCC AAATTAACTT AGAATTGGTA TAG ATGG CTT TTTTTTTTTT TTTAATTACC 
58201 TGGGAGGAAC CATCTATCCT CCTGTCCTGA AGGGAGTTCC TCCTAGGTCT GGTCAGAGCT 
58261 TTGTATGGTA ATTAAGATTT AGATCCCCTG TTAGGAAACC TGCCGGGTTA AGAGAATTTT 
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AGTGCCCACT 

CAGGAAAGTG 

CTGCACAATA 

CGTAAGATAC 

GCATGGAGGT 

AGCACTGACT 

GGACCCTGCA 

CTACCTGATT 

AATCCCAGCA 

AGCCTGGCCA 

GGTGGCACAC 

CACAGTGATG 

GAGTGACACT 

GCCAAATGCC 

CCTAAAATTA 

TCTTGGGAGG 

CATGGTGAGA 

CCTGTAGTCC 

AGACTGCAGT 

CCATCTCCAG 



TTAGGATACT 

TTTTACTTTC 

CGCGGGTTAC 

TGACAACAAA 

CCTGCCACGT 

CTCCAGATCT 

GCCTAAGTCG 

AGCTCCCTGA 

ATGAGAGGGT 

GTAATCAGAA 

GGAATCTATA 

GCGGTGCCGG 

TCTTTGGAGG 

AACAAAATTT 

AGAAAATAAA 

TTAGTTATCA 

CTAATAATAA 

GAGGAAGAGG 

TGACCTCCCC 

CCTCTCCACA 

CACACATATA 

ATAAATCTAG 

ACACACAAAA 

AAGGATCCCA 

AATCAGACGA 

CCACATAAAA 

GGAGGCCAGG 

ATCTCTACAA 

TACTTGGGAG 

TGAGATCACA 

AAAAAAATTA 

GAACAAGAAA 

GATGTGTCCT 

GTATTGGTGT 

CCCTGGGAAT 

TGGGCTGACT 

TAGAGTTGGA 

ACACACACAC 

TTAGAGGCTT 

TCCATGAAGG 

ACTGAATATG 

AGAATAGCTT 

CTTTGGGAGG 

ACATGGCGAA 

ACCTGTCATC 

GAGGTTGAAG 

TTGTCTCAAC 

TGCTAAAATG 

GATGCAATGT 

CCGAGGCGAC 

CCGTGTCTCT 

CAGCTACTCA 

GAGCAGAGAT 

AAAAAAAAAT 



TCTGAACCGG 
TTCTGTTAGC 
TGGGTTAAGG 
GTGCCAGTTC 
GCTGCTCTGG 
GTGTCG CTC A 
CCTAAGGGGC 
GTGAGCAATT 
CGTGATTGAT 
CAGAACAGAA 
GATAACATAA 
GCTGTTTGCC 
CAGAAATTGG 
TCAAAGTCCT 
AATGATTGTA 
GTTAAAATTC 
GAAGGGAGGA 
GTGACTCCAG 
TCCCCAAAAT 
TATGTATACC 
GCTGACCTCC 
GTCATACTCA 
AAGAAATTGA 
TGGATAAAGT 
AAGGATGAGT 
AATGTATGAG 
GCGGGCCGAT 
AAAATACAAA 
GCTGAGGCAG 
CCATTGCACT 
TATATATATA 
TTTAGATACA 
TTTGCATTTA 
GGATATGGAG 
ATTTTCCAAA 
GATATCCTTC 
AG CAATGG AT 
ACACACACAC 
TCTACATCAC 
GAGATTGAAG 
CAGAAAAAAG 
TTTCAGAAAA 
CTGAGGCGGG 
ACCCCATCTC 
CCAGCTACTC 
TTAGCTGAGA 
AACAACAACA 
AGCACCCAAG 
TGGCTGGTCA 
AGATCG CTT A 
ACAAAAACGT 
GGAAGC TGAG 
CATGCCACTA 
AAAGAGAGAG 
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AGAGATGCAA 

AG C ATGTTAG 

GTACCAACTT 

CTCCATGCTC 

AGCACTGACA 

CCTGGTCCCT 

CCATTTCCCA 

AGAGGTGGAA 

CCGAGATGTT 

AGGTAGTGCT 

ATGCATTCCC 

GTTCCCTTGT 

GCTGCTCAAA 

CCCCACCACA 

TACAGGGTGC 

ACCGTCTACC 

TGTACCGGAA 

TGGAGGTAGC 

TTGGAGTCAG 

CATTTTGCCA 

AGCTATTCGT 

AAAACAAAAA 

GCCGGGGACC 

CCTTCCCAGC 

AGGGTCTTTG 

CCTTATAAAA 

GAGTAGACAC 

TGGGAGAGAA 

ATACTTTGAT 

GCCAACAAGT 

TGGACTGAAT 

GATGGTACTT 

GTGTCTCTCA 

CTCATGCCTG 

AGTTTGAGAC 

GGCCAGGTGT 

TCCCTTGAAC 

TGATAGAGAC 

CTGATGCCTG 

AGTTCAAGAC 

AACTGGGTGT 

TTGCTTGAGC 

AACCTGTCTC 

CTGCCTAAGC 

GCCACCAACA 

CTTGGACTTC 

ATAGTATTTT 

ATCACAAGTC 

ATGATGAATT 

GCTTTACAAC 

ACTAGCTACT 

AGAATGTTTA 

CTGGATTGAA 

CTGGTGGCCA 



TATTTAGGGT 

ATTCTGGGTC 

TGGAAGCCTG 

AGCTTGGCAA 

GGTTCCATTC 

CAGTAATCTC 

CCAAGAGGTC 

TGAAGAATGA 

AGCTAACTCA 

TCAGCCTCAG 

ACAAAAATAA 

CCCGCTCGCA 

CCTCTAGGGG 

GTGTCTAGGG 

TCTTTTAGTT 

TTGTCCCAAG 

TAAATCAGAC 

TCTCAGCAGT 

CCACTCAGTG 

GGCAAACGTT 

GTCTTGTGGC 

TGCCTGTCCT 

ACGCCCTTCC 

ACTTTCCCCC 

CACTTCATCA 

AGGAGAAATG 

AGGGAGAATC 

ACCTGGAACA 

TTCAGACTTC 

TTGAGGTACT 

TGTGACTCCC 

GGAGCTGGGG 

TGATGAAATT 

TAATCCCAGC 

CAGCCTGGCC 

GGTGGTGCAC 

CCAGGAGGTG 

TCCATCTCAA 

TAATTCCAAC 

CAGCTTGGAC 

GGTGGTACAC 

CCAGGAGGAG 

GGGAAAAGGA 

CCTACAAGCA 

AGTCAGGAAG 

TGAGCTTCCA 

ATTACAGCAG 

CACGCCTCCA 

ATTTTTAAGA 

GTGATAATAG 

GGCCACTTGT 

ATTTTACTTA 

CAGCACAGCT 

AGCAGCAATG 



TCAACAAGAC 

CTTCATCCTA 

GATCTTCATC 

GAGTATCTGT 

CCACTAGGGT 

AGCATGGTAG 

TGATGGCTCA 

ATCCTGGGCT 

TGAGAGC CAG 

CAGTCCACAT 

AGTTGTTGAA 

TGGCATGTGA 

AACAGTAAGA 

TTGAATGTTT 

TTGCCATTTA 

GACAGAAGAA 

CACACCTGGG 

TGGGCAAAGC 

GCCCAGGCTC 

TGTTGTGTGC 

AGGCCAGGGG 

CACCGTGGTC 

CTTCCCCACT 

TCCTGTATCA 

GTTAAGATAA 

TCATACACAG 

ACCATTCAAG 

GATTATCCCT 

CAGCTTCCAG 

TTGTTACTGC 

CGTCGCAAAA 

CGTTTGGGAA 

CATGCCCTTA 

ACTTTGGGAG 

AACATGGTAA 

GCTTGTACTC 

GAAGTTGCAG 

AAAAAAAAAA 

ACTATGAGAG 

AAAATAGTGA 

ATCTGAGGCT 

GCTGCAGTGA 

GAAAACAGTG 

CAAAAAGGAC 

AGAGCGTTCA 

GAACTGTGAG 

CTCAAGGTAA 

GAAAAAGACT 

ACTTTTAAGG 

AATGCTCTGT 

GACTATTGTG 

ATTTTAATTC 

CGAGTCTTTT 

G C AGGTAGT A 



TGAACTTCTG 

ACCCCCTGTT 

CCCTCATGAT 

CTTCTCCTCA 

GGCACCCTAT 

CACAATCGAA 

TCACATAGAC 

CTGCTCTTCC 

AAACCAACTG 

TCTAGGAACC 

GTCCTAACCA 

TAGGAGTGTG 

CGGGCAGGTT 

ACAGCTCCTG 

TAGGCAGCTG 

GGCTTTCTGT 

CTTAGAGAAA 

CAAAAGTGGA 

TCCTGCAACC 

TCTTCTGCCA 

AGGTCTTGGG 

CCTGGGCACA 

TCCATATCAT 

GGACCTGTGA 

GAGTGGGCTC 

AGACTGACAC 

TCAAGCAATG 

CATTGCCTTC 

GACTGTGTGA 

AGCCCCAGAA 

TTCATATGTT 

GTCATTATAT 

TTAAAAGAGA 

GCTGAGGTGG 

AACCCCATGT 

CCAGCTACTT 

TGAGATCACA 

AAAAAAAGAC 

GCTGAAGCAG 

GACCCCCAAC 

CCAGCTACTC 

GCCATTGCTG 

AGACCTCTTT 

ACCACATGAG 

CCTAGAAACT 

AAAGTTATTT 

CTAACATAGT 

TCCCTAAAAA 

GATCTGACAA 

GATGACAGAA 

CACTTGAAAT 

ATTACAATAG 

AGAGGGAGAC 

CACACACAAG 



ACTCCTTTCC 

CATGCCATAG 

AATGAGTGTC 

TGGGACGGTC 

ATGGTCTGAG 

AAGGGCTAGG 

TGAAGGAGAT 

TAGGCCTGTC 

CAGGCTGGCC 

CTCATAATAT 

CCAGTACTGA 

GCTAATTTCT 

GTGGGTCTCC 

AAGCCACAGT 

GTGTTAACCA 

ATCCCAGGTT 

GAGTG CAAGG 

TGGAGTGGGA 

ACCCCAGTCA 

GTGTGCTCCC 

AAATGCAACA 

GGCCTGGGGG 

TTAAAGGGAC 

ATGTGGCCTT 

TAACCCAACA 

CTATAGAGAG 

AGTCTGGGGA 

AGAAGGAATC 

CGATAAATAT 

AACTAATACA 

GAAACCCTAA 

TTAGACAAAC 

CAACAGGCCA 

ATGGATCACC 

CTACTAAAAA 

GGGAGGCTGA 

CCACTGTACT 

AATAGAGCCA 

GAGGCTCGCT 

TTCTAAAAAT 

TGGAGGCTGA 

TCCAGCCTGG 

TTCTCTCCTC 

CACATAGTGA 

GAATTGGCCA 

TTTTTTTAGC 

AGAAGGGATG 

TTAGTCTGAG 

GTTTGCAAGA 

ATCTTTCCAC 

GTGACTGGTG 

CTACATGTAG 

AGGACTCACC 

AGGCAGATGA 



CTACCTCTCC 

CCACCCTGTG 

CCATTCAGGT 

ACATTCACCC 

TCCAGGCCTT 

CACGGCAGCA 

TCTGAAGAGC 

TTCCTCTCTC 

TCAGGCACTT 

GGGTTGAAGT 

AATGGGAAAA 

TCAGTGCCTG 

AACCCCATGA 

GGGTGTGTGT 

ACTCAATTAG 

CTTGCCTTGG 

TTTTATTAAG 

AAGTTTTCCC 

AATTCCGCCT 

CTGGACGTCC 

TTTGGGCAGG 

TGGAGCCCTA 

CATGCCCTTC 

ATTTGGAAAT 

TAAAGGGTGT 

AAAATGTGGT 

TACCAGAAGC 

AAACCTGATG 

CTGTTGTTAA 

GTAGG TACTA 

CCCCCAGTGT 

TCATCAGGAT 

GGTGCAGTGG 

TGAGGTTGGG 

TACAAAAATT 

GGCAGGAGAA 

CTAGCCTGGG 

GGTGCTGCAG 

TTAGCCCAGG 

TTAAAAAATG 

GGTGGGAGGA 

GCTACACGAG 

CTTCTCTCCA 

GAATGCTGCT 

GCACCTGGAT 

GACTAAGTCT 

AATTATGGAG 

CAAAATTCGA 

GCTAGAGAAT 

ACTGTTCAAA 

TCTGAGGAGC 

CTAGGGGCTA 

AAGGTGGATG 

TACAACACAT 
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64 801 CCTTCCCAAA CCTGGAGATA AGCTCACCCC ACAATCCCGC CGCTGAAATA GAGTTGATGT 

64 861 TACCAATGTG CATTTTTATG TCCTTTTCCA TACAGAAAGA TCATTCAACA AGTACTATGG 

64 921 TACTTAAAAA ACAACATTCA ATTCATTATT ATGACAAAAT TAAATTAATA GCTCTTCCTT 

64 981 AAACTTTTAA ATTCAATTTA CAATG CTTAC TATTGGCATT TATTAATCTA CCAATTTTTT 
65041 CCCATAGAAC CCATAGAACA AATAATCTAC CAAATTTTTA ACATTCATTT TTGGCAAGGC 
65101 TTTTGCAATT TGACGAACTT TAAGAAGAAA ACTTATAAAT TGCAATTTTT AAATCTGACA 
6 5161 TACTGGACTT TTAAAGTATC CAATTGACTA ATGAACAAAA CTGCTCCAAA TTTTTCAATT 
6 5221 CTTAAAAATC TTAAGACAAT ACTTAATATG GCAAATCTTA ACTTCTTAAA CTTTGTAAGA 
6 5281 ATGCTAATCA ACTTAGATTG GTATAAAGTT GAGTTAAAAA TCACAGGATA CATCATCTCA 
6 5341 GCTATAAGTT TTCATGAGTT GAGTTTTTAC AATCACTTGA AATGCTTAGA ATAGGAAATA 
65401 CGTATAAATT ATTTAACATA AAATATTGTT ACAAAACCTC TGGAGTGTCA GTTTCTCTGG 
65461 CCAGACTTTA TGCTGCAGCA CCTTTGCCTG AGTTCTTGTC CTGCATCCAG GAAGAATTAG 
6 5521 GTACAGAGGC AAGAGT CAAG AAGATTAGTT TTCCAATAGT TCAGCTCACC TAGTTAACTC 
6 5581 CTGTTCACAA TCTTCAAAGT TATCAGAAAC CTGCAATTGA GGGTTATAAT CCATTCTTTG 
65641 CAGAGTTTCA AAACAAGACA ACATTTGTCT ATGAATGTTA AAATGTCCTA GGGTAGTCAC 
65701 AGTCAAAAAC ACAATTGACA AAGAAATTTA GTCACCTCTG TGATTTACAA TAGCCTAACA 
6 5761 CAATAACTCT AATTATAACT GATGACACAA ACTCAGATAT CAGAACTCTA GAAATCCCCT 
6 5821 ATAATTTTGG AACACATATT CACAGTTTTC ACTGAAATAT GACCTGAAGA TCAAATATCA 
65881 CCTTATTTCA ACAATCCTAT ATAACTAAAC GTGTCAAATG ATCCTGTTTA CCTCTCCTTT 

65 941 GGATACTCCA GGGGCCCTCT GTAGCATCCA AAAGTTAGGG GTTAGCAAAG ACAATTTTGA 
66001 AGCTGTAAAG GCTCAAAACA CTTAATGAAC CTCTAGTCAT ATCTGTTCTC TACTCACTAA 
66061 ATG CTAGTAG CACCTCTCAG TTGTGGCTAA GCTGGGAGGA TCTCTTGAGC CTAGAAGTTT 
66121 GGGGACGCAG TGAGCTATGA TTATGCCACT GCACTCCAGC CTGGGCAACA ATGCAAAATC 
6 6181 CTGTCTCAAA AACAAAAACA AAAAACAAAT TGCCTATGCT GTGGTTATCT CACAATTAAT 
6 6241 AAAAAGGAAA AAAAAAGTAT GCAGTCTTTG TAGGTCCTTG GGGTTTGTTG GAACTCAGAA 
66301 AACAATACCC CAAAATAAAG AC CGCAGAAG CCAAAGTTTT TCTCTGATCT TCTCCTGCCC 

66 3 61 TCCTGTCTCT GAGTCCCATT CTCCCCGGAG TCTAGCCATA GAAATGAGAA TTCCTCTTCC 
6 6421 TCAAGTTAGG TCATAGAAAT CAAAACACCT TTTCCCCAGA GCCCAGCCAT AAAACCTAAA 
66481 AATATTACTC TAACTTTCCC TCTGTTTTTC TGTGTAAAAA CTGGCCATAA AGAAATTATC 
66541 TGAACTACCT TATTTGATCA TAGATCACCA GACCGCATTC CAGAGAGGAT CCAGAAGGAA 
66601 GGAATGCTGC ACAGAGAGGC CAAGAAGAAT CTAGACAGAC AGGCCTTGCT GGGTTTCCCT 
66661 ACTCTGTTTA TTAGCAATCC TATTTCTACA CGGCGGCCCA TACTTTGTTG AATCTAAAAA 
66721 ATAAAAATGG ACAATTTCCC CTGTACATGT TAATACACAT TAATAAATTG GATATAAATT 
66781 GGATAATTTA TTAATATACA CATTAATAAA TTGGATGCAG CCGGGTGCAA TGGCTCACGC 
66841 CTGTAATCCC AGCACTTTGG GAGCTGAGGC GGG CAGACCA CGAGGTCAAG ACCACCCTAG 
66 901 CCGAAATGGT GAAACCCCGT CTCTATTAAA AATACAAAAG TTAGCTGGGC GTGGTGGCAC 
66 961 ATGCCTGTAG TCCCAGCTAC TGGGGAGGCT GAGGCAGGAG AATTGCTTGA ACTCGGGAGG 
67021 CGGAGGTTGC AGTGAGCCGA GATTGCGCCA CTGCACTCCA GCCTGGTGAC AGAGTGAGAC 
67081 TCCGTCTAAA AATAATAATA ATAATAATAA TAATAATAAT AATAATAATA ATAAATTGGA 
67141 TGCATTTTAT CCTATTAATC TTCCTCTTGT CGGTGGTTTT CAGCGACTCT TCAGAGGCCA 
67201 AAGAGTAAGT TTTCCCTTAG CCCCTACAGG TTCTTATGTT TAATTTGTTA CTCTCATTTA 
67261 AGACATAATT AAAGTGGCTT CTCCATGAAG ATTATTTCTG CATCCATTAT TTGGTAAGAT 
67321 TGGCCGTTTT CTCCTTTGAT CTCTACTTCA CACTGACCCA CATAAAACAT CACTGCCTGT 
6 7381 TTTTTTGTTG TTGTTGTTTG GAGACGGAGT CTTGCTCTGT TGCCCAGGCT GGAGTGCAGT 
67441 GGTGTGATCT CCGCTCACTG CAAGCTCCGC CTCCCGGATT CACGCCATTC TCCTGCCTCA 
67501 GCCTCCTGAG CAGCTGGGAC TACAGGCACC CACCACCAAG CCCGGCTAAT TTTTGTATTT 
67561 TTAGTAGATA CGGGGTTTCA CTTTGTTAAC CAGGATGGTC TCGATCTCCT GACCTCGTGA 
67621 TCGGCCCGCC TCAGCCTCCC AAAGTGCTGG GATTACAGGA GTGAGCCACT GCGCCCGGCC 
6 7681 CCGTTTTTTT TTTTTTGGTT TTTGCATGTC TTCTCCCTTT TACTGTAAAC TATTTCCACT 
6 7741 ACCAGCGTAG TTATCATTTC TACTGCTTAA TAATTGTTTT GGGGAAGTGA ATGCATCAAC 
6 7801 CCACATGAAT TTCTTGTCTA TTTGACAATT TATTCTCTTT AGGAATAGTA TTAACTCCTA 
67861 AGGTCCTGGG AGCCAGTCTC TGTACTTGGC TGCTCCAGGG TCCTACTTCA GTTTCCCAGC 
6 7 921 TTCTCAGTAC TGTCACTGTC AATTGTGGGT AATAATTATT TTTGTCCACC AAAAGACTCT 
67981 GTATGTGAAT GAGTTTTGAA ATCTGCTGAG TAATACAGTG TCAACCCAGT TAATGATTTG 
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6 8 041 CCGGGCGGCT TGATCAGGGG CTGTCCAACT ACCGGCATTT TGATTTGGAG CGTCATCTAG 

6 8101 TGTCTGAAAG CACAAACAAC ATCCTACATT GTAAATGCCT TTGGCTACAG AGATTGAAAC 

6 8161 CAAAGCAAAC CTATGTTTTG AATTGTTATT CTTCAGCAGT TCTGCTAGCC TTGAAAAATC 

6 8221 TAAAAGTTAA AAAAAAGCTT TATATTTCAT TTTCTGCCTA AACTCTTTAA AATTGCTAGT 

68281 TGACAATTAG ATATTTTCAA TTTAATGAAA TTTTTTTTTA GTTCACAGAT TAATACACAA 

6 8341 TGGGGGAGGG TTCTTATTCT GTTGGACTTT TACATAACCT CCACTTTAGT GCAGTCTGCT 

684 01 TTATGGGGTC TTGTTTGAGG TGTGTGTGTG TTTAAGGGAA TGTGGTTTAC AATCAAAATA 

6 8461 TTGGGTTGCT CTTAGGCACA TTGTAAAGTC ACACACCTGT ATTCTTATTG ATACATAATG 

68521 ATTAATAACA TTATTATTAC AGCCTGATCA CCATCATTAT TGATATATCT AAATAATGAA 

6 8 581 TTTTATAATT TTGCTTCCTG T CAGGCAAG A GCCAATTTCA GTGCTACCAT GTTTGTATAG 

68641 CAGTATTTAT GTCTGTCATC CTCAGTCATT TTACTTCACT TGTTCTTAGC CAAACGGCCG 

68 701 AGAAGCGATG GTCATTTTAC TTCAAAAATG AAAAGAATTA ATATTTTTAC GTTTCCCTTA 

68761 AAGACCCTAT GTTTAACCTC CACTCCCGGG TAAAATGGTC TAGTCCCTCC TTTTCATATC 

68821 ATCTCTGATA TCTTTTGCAC AG C CACTATT ACCTACCGTT TTCTAGATCC CTATTCTTCA 

68 881 AACACCACCA TGAAGGTAGA GCCTGTCTGA ATTATTTTCT TGTCCCGTGA ACTCAGTACA 

68 941 TTGTTAGGCT TCTTGAAGAT GTTGATCAGT TGTTTGTGGA GTGAATGAAT CAGCTAGCAT 

6 9001 GATTTTTCTA GACCACTGAG ACAAGTGTCT AAGACACTTG TTCCTTCCCA TGTTCTTGCC 

69061 TGCCTGTGCA ATCCATG CAG TCTCATGGCT TCCCAGTGCC TCAGAATTAT CCCCTGTCAA 

6 9121 ACAGGCATTA TAATTTCTGT CCACTGAAAA GGACAAAAAA CTAAGTGTAT AG CTAGAAGT 

6 9181 TAAAAATTAC CGGCCAGGTA CTGTGGCTCA CTCCTGTTAT TCCAACATTT TGGGAGGCTG 

6 9241 AGGCGGGCAG ATCACCTGAG GTCAGGAATT CGATACCAGG CTGGCTAACA TGGCGACCCC 

6 9301 GTCTCTATCA AAAATGTAAA AGTTAGCCAG GTGTGGTGGC TCGCACCTGT GGCCCCAGCT 

69361 ACTCAGGAGG CTGAGGCAGG AGGATCGTTT GAGCCCTGGA GGTTGAGGCT GCAGAAAAAT 

6 9421 AGGAATATAC TCTCTTTCAA GAGTTCGTGG TTTTGACTGC CACCTAGCGT ACATCAGAAA 

6 9481 AACCGCATGA CATAGGAAAT GCCTGTGACA GAGGGGTAAG GTGAGAGAGG TTGATGAAGA 

6 9541 ATGTATTGAA GGAGTGAAAA CGCTTCCATC CCTCTACTTA CTAAATATAT TAGTTAAGTA 

6 9601 GTTGGGGCAT ATTTTAATTC ATGCATTTTG TAGATAGAAA AACAAAAGTT TTATTCTGTT 

6 9661 TGATTTAGTT GATACTTTAA TATGTGTGTG TTTAGGATGC ATGATTTATA ATCAGTCTGC 

69721 AGCACTTCTT GGAGAAGTCT GAATTCTCAT TCTCCATTTC CTTATTGGCA ACGTGAGAAT 

6 9781 GATTACAATG GTGGTTGTCT CATAGAATGC AGGGAGTCAG AATGAAAATA GTCCATATAA 

6 9841 TGCCTGGTGC AGAGGAAGGG TTCAGTTAAC TGTCTGTATT AATATTACTG ATAACAGTCA 

6 9901 TGACAAACAA AAG CTTAAC A ACAACACCAC CAACAACAGT TGCAGAATTG AGCCACCAAT 

6 9961 TTGCACACAA GATTGTAGGT AGGATGTTTT AGAAAAGTTA TTATTTAATA TATGTATATA 

7 0021 TTTTTGTACT TAAAATATGT CAGAGGTTGT TCTAAGAACT ATTTAAATGT TAACTCCTTA 
70081 ATCCTCATAA TGACCCATGA AACAGGTAGG CTTATTATTG TCTCTTTACA TGTGAGAACA 
7 0141 CTGAGACACG AAAAGGTTTA TTAACTCACC CAAAGTCACA CAGCTGGTAA AACGGCAAAA 
7 0201 TTGAATTTGA ACTCAGACAT TCCAGGTTCC AAGACAGTCT AATTATTCTT TTGACTAATA 
7 0261 TACTAAGCTG CCTCTGTATT TTTCCTTGAT TACTTTGTAA AAGTATGAGG AAAATATAAG 
7 0321 TGCTTCAAGT AACCATGAAA AATATAAACA ATCTATGTAT CAACTGAAGC ATAATTACAA 
7 0381 ATCCTTTGAT AAGCAAACAT AATAAAAATT TGATATCAAT CAAAACTTTC ATGTAATGTA 
7 0441 AGCAGGTTGA GATGAATTCT ATAGTAAAAA AGTGCAGAGT GCTGGAATAC CATGCTCCTA 
7 0501 ATATATTGGC TAGGCACACC TGCCTGCTAT CAAAGGTATG CACACACCTT G GAT AC AG AA 
7 0 561 AGTTGGGACT GGGTAGTTAT GTGAGTGTCA TCAGAATTCT TTCCCACTTG GGAAAGAATT 
7 0621 GTCCATCATA AGCTTGGATG ATGGACAAGG AGTGAGCTCC CAGAACAGTG ATGTGGGGAT 
70681 ACATCCTCAC ATCACAGTGA GAATGAGTGT TCTAGACTGT TTACACACCT ACCACTCCTA 
7 0741 AATG C AC AC A TATAATTGCT TG C AC AC AC A CACATACACA CTCATCTCTT CTCTGGTGGT 
7 0801 CCAGCTCTAT CTCTTATCAT TAGGCTTCTT GGGGCTAGTA CCTAGGGCCT GTATCCTTTC 
7 0861 AGAGGCAGCT AAGGGAAGCA CACATAATTA GAAAGAATGA ACCAGCTTGT TGGATTTGGT 
70921 CTCTTCGCAT CCAGCCCTCC AAGTTAAGGA GAG TAC CATC TTTCTTAGGG TCACCAAAGG 
70981 AAAAAAAAAA AAAAGAAAGA AACAGAAGGA TATCATACAG CAAGGATCTA ATGCAAATAT 
71041 GCCTCAAATG AGAGGCTACT GTGTGCTGAT CCCAATCCCA GGAACTGTAT GCACATTATC 
71101 TAATTTAATC CTCACTGTAT TTCTGGGAGT ATTATTCCCA TTTTACAGAG AAGGAACTTG 
71161 GCAGGGTAAC CAAGCTCATG AATGGAGAAA CTGGGATTAA ATATAAAGCT TCCTTGCTCC 
71221 AGAACTGCTG TCTTTCTGCT CTTCCACACT ACCAGCTCAG CTGTGCTCTC TACATGCAGG 



Figure 9 (Page 22 of 74) 



SIIRSTITIITF RMPPT mill C oc\ 



WO 98/14466 



PCT/US97/17658 



111/162 



71281 
71341 
71401 
71461 
71521 
71581 
71641 
71701 
71761 
71821 
71881 
71941 
72001 
72061 
72121 
72181 
72241 
72301 
72361 
72421 
72481 
72541 
72601 
72661 
72721 
72781 
72841 
72901 
72961 
73021 
73081 
73141 
73201 
73261 
73321 
73381 
73441 
73501 
73561 
73621 
73681 
73741 
73801 
73861 
73921 
73981 
74041 
74101 
74161 
74221 
74281 
74341 
74401 
74461 



CAGTTTTACA 
GGGAACTTTT 
CTATGGTATA 
ATAAATAAAT 
ATTAAATAAT 
TATTTTTTGG 
AAGAACCACT 
CTTGAAGGCA 
CAGTGGCTGT 
CTGTCCCACC 
TATTTCTCCT 
GTAAAAAACT 
TTTCTTCCAA 
AATATACGTT 
GTAAAAATGT 
TTCTGAAATG 
CCAGTTTCCT 
TTTTGCAATA 
TTATATTTAT 
AGTGTTGTGA 
TCAGCCTCAT 
CTATTTATTA 
TTTGAGGGCG 
AAATATTTAG 
GAGCAAAACT 
AACAGACAAG 
TTGGTCTCAA 
GTGGCTGTTA 
CAAAGTTCTG 
GAACCCTACT 
CTGAGGCAGG 
ACCCCTGTCT 
GAAAAACCAG 
TTGAATATAT 
CATTTTGCTC 
TGAAACAAAA 
ATATATTGCA 
ATCAGTTTTG 
TTTTTTTTGA 
GCTCACTGCA 
GCTGGGATTA 
GAATTTC AC C 
AGCCTCCTAA 
AATAGACTTT 
AATACAGTTT 
GCTTTGTATC 
CGAAATTGAA 
TATATGTACT 
AAGGAGGAAG 
TGGTCAGGGA 
TGACTTATTA 
TTTCCATCTC 
TTTTAGATAG 
ACAGTGTTGT 



AGTTTCAGAT 
GGGTTTACTT 
TATGATAAAT 
ATTAATTTTA 
TAATACTCAG 
AGGCCTGATA 
ATTTTAGGCT 
AACGTTTAGC 
GTCTTTTCTA 
AGGTATAAGT 
GGTGTCCTGT 
AGACATTAAA 
TATAGTCATT 
TGCATCTTGT 
GCATATCCTC 
CTTTGACATC 
ATGTCACTTA 
TATTGTTCCT 
GTATTTATTT 
TCATAGCTCT 
GAGTAGAGTA 
TGCTCCTACT 
TGTTAGGGAA 
CCTCAGGTTT 
GTGGCTCTGG 
AGCCCCTACA 
GCAGATAGCA 
TTATTAGCTT 
GCCTATACAG 
CTAAGGCTGG 
AGGATCACTT 
CTATCAAAAA 
CTGTCACCCT 
CTTGGCTGTT 
TGCATTTTTA 
CTAGTCAACC 
TTACAATATT 
GGTGGGACGA 
AATAGAGTCT 
ACGTCCGCCT 
CAGATGCACG 
ATGTTGGTCA 
AGTGCTGGGA 
TTTTTTGTTG 
C CATGGAAC A 
TTCCAGTTTT 
CAACCAAGTG 
ATATATAGTG 
CAGAATCACA 
AAGGATGTAT 
GGCAATACAA 
TATGACAAAA 
TCTGGACCCA 
CCAATAAGGT 



TAGCCTGGGA 
TCCATTTTTT 
ATATGGCTAC 
TAATATTTTA 
CTTTGTTTTC 
GTTTTTAGGA 
GTTGTCTTCT 
CAGCACATTA 
TCGATTTCTC 
TCTTGAGAGG 
GCTTAACAAG 
AAATAATGTC 
GTGTCAGGTT 
GCTTTATAAC 
ACAATTGACA 
ATTTGAAAGA 
TACAATTATA 
TTTGTAATAC 
TTCTGGACAG 
CTGCAACTTC 
GCGGGAACTA 
GTGTGCTTTA 
TACAGATGCA 
AATCTAATTG 
GTTATATGTT 
ATCTTATTTA 
ACACTAACAC 
CAT TAATTGG 
GATTTAGTAA 
GCTTGGTGGT 
GGTGCCAAGA 
CAAAGAACTC 
CATTCCTTAC 
TGAGTCTCTC 
ACTTTTCTAC 
TATAATATTT 
TTAACTGTGT 
CCACATCCTT 
CGCTCTGTCA 
CCTGGGTTCA 
CCACCATGCC 
GGCTGGTCTT 
TTACAGG CGT 
TTGCTCACAG 
CCAAC CAG AT 
TCAGAATGGC 
TCAAAGTACA 
AGCTTGTGTA 
ATTAGGTCAA 
ACTGGAAGAG 
TAATAACTTT 
TCCTTATTAA 
ATAAAATGTA 
ACCACTAGCT 



CTTCCAGGGT 
CTTCATACAT 
ATATGAACTA 
AAGGTTATCA 
CAAAGTGATA 
GTGTAAAGAA 
GTCTTATTTT 
ACATTTTATG 
ACACTGTATG 
ACACACTGCT 
TGCTCATTAA 
AACCAATCTA 
ATGTACTTAT 
TGCCTTCATA 
AATTCTTATC 
AG CTTGAAGA 
ATGGCAATTT 
TCTCTATGTA 
AGTCTTGCTC 
AAACTG CTGG 
CAGGCGCATG 
GTATATTTTC 
GTAACTTTGG 
TTGGCCATTT 
AAAAAAAAGT 
GGCTGAAAAT 
TTACTCTTTG 
TGAGTCAGGA 
TATTAGGTTA 
TCACACCTAT 
GTTTGAGACC 
TAATTGGCAT 
ACCTGTCCTA 
TCTAGCCCCA 
CAGGGTTTCC 
ATGATGTGTG 
CCTCAATTTG 
AATCTGAACT 
CCCAGGCTGG 
AGTGATTCTC 
GAGCTAATTT 
AAACTCCTGA 
GAGCCACCCC 
GCTTGTTCAA 
ATCAGGTTGC 
TTCTAAAGGT 
ACATTCAGGA 
TGTGTCAATG 
AGGAAGATAC 
GAAGGGAAAA 
TAGGGTCATT 
TTTATTAAAC 
AACATTAAGT 
ACACGTGATC 



TTTGAATGGG 

ATGTAATATA 

TATAATCACA 

AATAAATATT 

AATG CCTATA 

GTCCTGATAT 

CCCAGCTAGA 

TTTTTATTCT 

ATGGTTATAT 

AGGCTGATCT 

GTGTGTAAAA 

TTGAAATTTG 

TCTGATGAAG 

TAGACACAGA 

CTTTGAGGGT 

ATAAGATAGC 

CAAAATGTTA 

TTTATTTATA 

TGTTGCCCAG 

GCAAAAGTGA 

CCACTGCACC 

TGTTGTTTTC 

TCTCAGCCCT 

GCCTTCAAAG 

TTATGGGGCT 

ATCCTGGAGT 

AGGCAGGCAC 

AAAAACAGCT 

GCTACATCCA 

AATCTCAAAA 

AGCCTGAGCA 

AGTAGAAGGA 

ACAACTCCTC 

TTACTGCTGT 

AGACCCTGAA 

TGTAAATAAA 

TTTGTGGCTT 

TTCCCTTGGA 

AGTGCAGTGG 

CTGCCTCAGC 

TTGTATTTTT 

CCTCATGATC 

GCCCGGCCAG 

TCTTATTTCA 

TATGGAGTTG 

TCTGATTCAG 

AGTTAAAAAC ' 

AATGATTTAA 

GGGAGAATAA 

TCAGATATAA 

TTTTCTATAT 

TTCTACAAGT 

CAGAGTTACT 

ATTGACCATT 



TTAGGGAATG 
TAACATAAAT 
TATATGCATT 
AATATAAATA 
TTT AG CAAAA 
CTAAATGTTT 
CTGGTAAATA 
TTTGTGCTCT 
TTGTCTGTAT 
TAGTTTTTAT 
ACACAGCACA 
CATTTC CATG 
ACTATTGCCT 
TTGAGAAGGT 
AGGTTTGACT 
TG TTAATG AC 
GGTAAATATA 
TTTTTAAATT 
GTTAGAGTGA 
TCCTCCTGCC 
CAGCTAATCA 
TGCAACCCAT 
TGAGGTGAGG 
ATTGAAATAT 
GAAGCCAGGC 
CCCTGTATTG 
TGCCAGTGGG 
TTAAATCATT 
AAAGATGACA 
CTTTGGGAGG 
ACATAGTGAG 
AAAAGTGAAA 
TCACTATCCT 
TTGGACTTGA 
GAGTGTGGCA 
AGAATACACA 
TCTTGAGGAC 
GGTCATTCTT 
CGCAATCTCA 
CTTCCAAGTA 
AGAAGAGACG 
TGCCCACCTC 
AGGTCATTCT 
AAATTTGAGA 
ATAGTCAAAA 
AGCTCTTAGG 
ATGACTGACA 
TTCATTAATG 
AATATGTATT 
AGTTGTTTAA 
TAAGAATTCA 
GAATGTTTAC 
TTCACGTAGG 
TGGACTATAG 
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CTAGACTGAT 

ATTTAAAAAA 

AATAGTTTAG 

TCATTCTTTC 

TTACTGTCTA 

CTCTAATATA 

AACTTTTTCA 

TTCTGGCCTC 

TAATGGCTTG 

ACTAGAAAAA 

AACCAGGACA 

TAG CTATTTT 

CATAGAATAA 

TGCTCTGTTG 

CAATCTCGGC 

GGTTCAAATG 

CACGCCAGGC 

GCTGGTCTCG 

ATTACAGGCG 

TTCTTTTTTT 

TCGGCTCATT 

GTGCCACCAC 

CTAGGCTGAT 

GGATTACAGG 

AGTATGATGC 

CCCTGTCACT 

CACTAATCTC 

ACAAATGTTG 

TTTCACTTTT 

TCTAAGGCTG 

TGGGGAGTTG 

GGGCAGACAA 

AGAGTAAAAT 

AGGGGATGAG 

GTTCTGGTAC 

AATACATTGT 

CAACTCAAAA 

AGGTATGGTT 

AG CGTACAAA 

GCCTCTAACT 

TATGATCTCT 

AGCTCTTTAA 

CATCTATGAA 

AGCCCCCCAC 

TGCTTTGGAC 

GAATTCACAG 

AGGTGGAAAC 

TGGGAAAAAG 

GTGGTCTCTG 

GTTGTTTTTG 

CCTTTCCTGA 

TGAGGTGGAA 

GGCGACCAGA 

ACGCTGCTTC 



TTAAAATGTT 

GAATGTCAAC 

ATATATTAAG 

AATGTGACCA 

GTATTGCCTT 

GCTTGGAAGG 

ACACACTTTA 

CACCATGTTG 

AGGTCTGGAC 

TCATATTAGA 

C CAATTT ATA 

CATGATGGAT 

GATCTAAGTG 

CCCAGGCTGG 

TTCTGGAGTG 

ATTCTCCTGC 

TAATTTTTGT 

AACTCCTGAC 

TGAGCCACCA 

TTCGACTGAG 

GCAACCTCTG 

CCCCAGCTAA 

CTCAAACTCC 

CGTGGG CCAC 

AGGTGCTTCA 

CTGAGGAATA 

ATTAGACTGT 

GACAAATGTT 

CATTTGTCAT 

TTCTCTTTAA 

TAATATT AC C 

TTTACTTTGC 

AAGAAGTAGT 

AAGAGGTGAT 

TTTTCTTGTC 

TCATCTTAAA 

GATATATTTG 

ACTGAAAGTA 

AATAAATAAA 

GCCAAATCTA 

ATCTTGAGGG 

AAAGAGCCCA 

AAGAGGGATT 

CACCCCGGAC 

TGGCCGTGGA 

AGGACCTGTG 

CATTTCGCTT 

AACCTTCAGG 

ACTGCCCCAG 

CGTTTGTGGA 

CCAAACCCAA 

CAAACACAAC 

GCACCAATCA 

ATCGGCGCTT 



CTAAAAGTGT 
TGTCTTTTTT 
TTAAATAAAA 
CTAGAAATCT 
ACATCATCAG 
ATATGGAGAA 
TAAAGGATCT 
CCAGGAGGTT 
TTGAGATTTG 
GAACTGAATC 
GTGAAAGAAG 
ATACTTAGCT 
AAATGTTTAT 
AGTGCAATGA 
CAACGAGGCA 
CTCAGTTTCC 



CTCAGGCGAT 
AGCCTGGCCT 
TCTCACCCTG 
CTTCCCGGGT 
TTTTTGTACT 
TGGCCTCAGG 
GGGGCCCAGC 
ATTGTTTATA 
GCCGGTCTAA 
TAATTCCCAG 
ATTTAATAAA 
TGAACTCTTA 
CACATATTTT 
TTTGTCCCTA 
CTTTAATCTC 
GAACCTTAAA 
TTTTCAGCTC 
TTTCTGTGTT 
AGTCAAGAGT 
AGTTCCCAAC 
ACCTACAATT 
TAAAAAATGC 
TTGGCTTTTT 
CCAGACCTCC 
ATTTCTCGCC 
TGATAGTTTC 
CCTAGCAAGG 
ATCCTTGTCC 
TTACTTCCCT 
TTTTCTTCAA 
GGCAAAGGAG 
ACTTCCTTCG 
AAAAATAAAT 
GTGATTTGGT 
TTGGGAGCAG 
GAGCGCGCCT 
TGCCACTTGT 



AAAATACACA 

TTTAGCTTAT 

TATCTTAAAA 

GGAAAGTATT 

GTACCCCATA 

ATATTTTTGC 

AGAAAAGGGT 

GGGGACAAGA 

CATATAAAGA 

ACAGCGATTA 

GTCCAGTTAC 

GAGTTTTAAA 

TTATTTTTTT 

GGCAATCTCG 

ATCTCGGCTC 

TGAGTAGCTG 

GTAGAGATGG 

CTGCCCGCCT 

AAGTGACATG 

TTGCACAGGC 

TCAAGCGATT 

TTTAGTAGAG 

TGATCCGCCC 

CTTATATTAT 

CACTTTCCAT 

GTGTTTTTCC 

AGGACATAAG 

ACAATGGGGT 

TTTGTAGGTT 

CATGAAAACA 

AATATGAATC 

AAGAAAAAAA 

GTAGCAAACT 

ATCAACAACA 

AAATTTTGCT 

GTGTTTTATT 

TGGAGATTGT 

TTCATGGGCT 

TTGTTTTCTT 

TGCAGGCTTA 

TGCCTTACAC 

TGTAGAGAAG 

AATGTCTTCA 

CTCATGAACC 

CAGTCCACAG 

TGTGAAGAAA 

AAATAAGGGA 

CGAACAGGTA 

GAGTTGGGGG 

GAAGAGCATG 

GCGGGGAATT 

CGCAGCGGCT 

GCGCTCTATA 

ACCCGAGTTT 



CCAGGTTCTG 

TTATTATATG 

TTAATTTTAC 

TATGTGATTC 

AGTAGGCTTT 

GTTGCTTTTA 

TGGTTACATG 

TTCTGGGTGG 

GATGTGATTA 

AATTTACATG 

CTGGTAATCA 

TGAGAAGGGG 

TTTTTTTTGA 

GCTTCTGGAG 

ACTGCAACCT 

GGATTAGAGT 

GGTTTCACCA 

CAGCCTCCCA 

TTCTTATATT 

TGGAGTGCAG 

CCCTTGCCTC 

ATGGTGTTTC 

CCGAGTCTCC 

TTCTTTTACT 

AATTTTGTAT 

ACCACTGCTA 

CACACAAGCA 

CACCCTTAGT 

CCCTTTTGAC 

TATATTTGAG 

TATAATTATA 

TAGCAATTAC 

TTAGAACAGA 

GATCTTATAA 

ATTTAAAAAA 

AAAGTCAGTT 

CCTATATGGT 

GAAATTCATT 

TGAAAACATA 

AGGGCTCTCC 

AACTCAGAGG 

TGAAAAGGAT 

AATCAAAGAT 

CCCTCCCATC 

TTCCTGTGCG 

CAGAATTATC 

AG CATGTGCC 

ATTTATAAGA 

AATTGGGGAC 

AAGCCCGAGG 

TTAATATTTT 

CAGAGCCTGC 

TATACAGCGG 

TTGATTCTCA 



AAGATTTATC 

TTGAAGTGAT 

TTGTTTCTTT 

ACATTCTATT 

TTAGATAATT 

AGTTTTGCAT 

TTTCTCTGTC 

CTGGATGTCC 

GATTGAGTCG 

TCGATTTATA 

AGACGTTTCA 

GTTCATTGCA 

CATGGAGTCT 

TGCAATGAGG 

CCACCTCCCG 

TGCCTGCCAC 

TGCTGGCCAG 

AAGTGCTAGG 

GTTCCTTTCT 

TGGCGTCATT 

AGCCTCCTGA 

ACCATGTCCG 

CAAAGTGCTA 

ACAATATATT 

AATTCTTATA 

ATTCATCCAT 

GACAATGTTT 

CTAAAAGATG 

TTTCCCACAA 

CAGAAATTGT 

TCAAATATAT 

TTGGGGTCGG 

ATAGTTTCAG 

TAAATTACAT 

ATAAATTTCA 

GCTTTATTTG 

AACTTGCGTA 

TCTATATTGC 

TTATCTCAGT 

CTTGTTCCTT 

GGGAC CTCAG 

GCCCCACCCC 

TTAAGTCTGT 

CCGCCCTAAT 

ACTG CACGAA 

ATGAAAATTT 

CAACCACCCC 

AAAACAGAAA 

GCCTGGACGC 

CTTCTGAGAT 

TCCCCTTTTG 

CAGCCAGGCG 

CCCTGCCCAG 

ACATGTCCGA 
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GACTGCTCCT 

GGCGGCCAAA 

CATCACCAAG 

AAAAGCGTTG 

TCTCAAGAGC 

CTCCTTTAAA 

GGGCGGAACC 

CGGCGCAACT 

GGCCACTGTA 

GAAAGCTGCC 

CAAGCCTAAG 

AGGCTCTTTT 

TGCCTTTTCT 

AGAGACGAAT 

TTTATTGCGG 

TGGCAGTCCC 

TTAGTCTAAA 

GGGTTGGTCC 

CGTTG CTTAT 

TATATTAATG 

CTGGAGGTAG 

CATTCCTCTG 

CCACAACAGC 

TGTGCCACAT 

CG T ATCG AAA 

CCTGTTTAGA 

CAGTCCAAAT 

GATAAATGGG 

ATGGTGCTTC 

GGCACATTTT 

TTATTCTACC 

CGGATTCATC 

GCAGGGAAGA 

CATTACAACA 

CATATTTTTC 

GCCACCCTCC 

TCATGGACAG 

CCACAATTCT 

GCTGAGATGA 

TATCAAGCCT 

GGCATTCTGT 

AGTCATATAA 

CCCAGGCTGG 

AGCGATTCTC 

CGGCTAATTT 

GAACTCCTGA 

GTGAACCACC 

TTTATTATCA 

GCCATGTGTA 

CATCCACTGG 

TAATTAACTG 

ACTCTTCTTT 

AGATCTCTTC 

AAAGGCTGGG 



GCCGCTCCCG 

AAGGCTGGGG 

GCTGTGGCCG 

GCTGCCGCCG 

CTGGTGAGCA 

CTCAACAAGA 

AAACCTAAGA 

CCGAAGAAGA 

ACCAAGAAAG 

AAAAGTG CTG 

AAGGCGGCGC 

CAGAGCCACC 

TGTTCTGCCC 

TGGGTGACGG 

CTTCTAGGTC 

GCCCCAGGCC 

GGGATGTCCG 

GTTCTTCTAG 

TCTGTGTTTT 

TCTGGGATTT 

TGGCAACATC 

CTGTTAATTT 

GGCAATAGCC 

AACATGT AG C 

GCACAACTTT 

CAGACAGCAA 

GTTTCTATGC 

AGACATTTCT 

TGAATACAGA 

GGTGAGACCT 

ATCTC C AC AA 

CCAAGAAAGA 

AGGAGAAAAC 

CGGTTTAACA 

CCAAGACCAT 

ACGCTCCTAT 

TTGGACTGTC 

TAAGGTAGAA 

TTATGTGACA 

AATGCTACTT 

CATCTCACAT 

TTATATTTAT 

AGTGCTGTGG 

CTGCCTCCGC 

TTGTATTTTT 

CCTCAGGTTA 

GTTCACAGAC 

GTTATTGCTA 

TATAGAAAAA 

GGGTGCAGTT 

AGATGTTGTA 

TTCAGAATTT 

CACCTCCTCC 

CGCGGTGGCT 



CTGCCGCGCC 

GTACGCCTCG 

CCTCTAAAGA 

GCTATGATGT 

AGGGCACTCT 

AGGCAGCCTC 

AGCCAGTTGG 

GCGCTAAGAA 

TGGCTAAGAG 

CTAAGGCTGT 

CCAAGAAGAA 

ACTGATCTCA 

TGTTACTTAA 

GGTTGGAGAG 

CCTGACCGGA 

TGTGAACGGC 

GATTGGACTA 

TACATGACTT 

TTGCTTTACT 

CGGACGCTTT 

CAGCCCTGGG 

CTCATTCCTG 

CTTCCTCCAC 

CTTC CGCTAA 

TAGCCAGCCA 

CATTTAAAAA 

AGAAAACAGT 

AATAAAGGCC 

AAGCCTAGCG 

AAATTATGGG 

TGATTAATAT 

GAAAGGGGAG 

ATTCTCCCAT 

TGGTGAACCC 

TTATGAACTT 

CAATTTTGGC 

TTAGGTTTCT 

TTGTATTGTT 

AATGGCAAGT 

CACAATGCCT 

CATCACAAGT 

ATTTATTTAT 

CACGTTCTCG 

CTCCCGAGTA 

AGTAGAGACG 

TCCGCCCACC 

TCAAATCATT 

ATCTCTTACA 

AACAGTGTAT 

TATTAAACAT 

ACGTGACTTT 

TCCTGGTTAT 

TGTTTCTCCA 

CACGCCTATA 



TCCTGCGGAG 

TAAGGCGTCC 

GCGTAGCGGA 

GGAGAAAAAC 

GGTGCAAACG 

CGGGGAAGCC 

GGCAGCCAAG 

AACACCGAAG 

CCCAAAGAAG 

GAAGCCGAAG 

ATAGGCGAAC 

ATAAAAGAGC 

GGTTAGTCGT 

TGGCCGTGGT 

GGCTTTTCTC 

AGAAAAGACC 

AAAAATTTTC 

TCATTCTGTA 

GTGACTTAAA 

CCATGTTGTT 

AGGAGAGTGC 

TGGCAACGAA 

CCAAGGCAAT 

ACTGACAGGT 

TTTTGTCCTC 

TCGAAGTTCC 

ATTTGTACTA 

TTCGTTAATG 

TCTTATATTC 

GACTGGGGCT 

AGTGAGTTGA 

GGAGGCAAGC 

GGTTTAAGTA 

TCTATTTTGG 

TCATTTCTGC 

TGTTTTGTCA 

CAGGTTTCTA 

TTAAACATTG 

GTTCAACTAA 

ACTCCATTCA 

AAAACGGTAA 

TTATTTATGA 

GCTCACTGCA 

GCTGAGATTA 

GGGTTTCACT 

TCATCCTGCC 

TTTATTACAG 

GTGCCTGATT 

ATACGGTTCA 

GCATTTACAT 

AATAGCAGAT 

TCCATTTTTT 

TCTCAACATC 

ATCCCAGCTC 



AAGGCCCCTG 

GGTCCCCCGG 

GTTTCTCTGG 

AACAGCCGTA 

AAAGGCACCG 

AAGCCCAAGG 

AAGCCCAAGA 

AAAGCGAAGA 

GCCAAGGTTG 

GCCGCTAAGC 

GCCTACTTCT 

TGGATAATTT 

ATGGGAGTTA 

GAGGTTACAG 

GCTGGCGGAT 

GCAAAACAAG 

AAAAGTCCCG 

TTTAATTGGA 

AGTTTTGCCT 

GGTAGTCAAG 

GTGCAGGTAC 

GGAATGCATT 

CGTGGACCTA 

TTGAGCGTAT 

GCATGACTAC 

TTTAAACGTA 

TTAACTATGA 

GTTCCCTCTG 

GCTTCTTTTA 

TCTGGAGATA 

TTTGTTAGTG 

AGAGAGACAG 

ATTTTGTGTT 

TGTAAGGTTT 

TTCCCCCTTC 

TAGGCTAATA 

TTTTGTTCCT 

TGTTGTGTGC 

TACCTAAATC 

CCGCACTTTA 

GCTATTTTGA 

GACGGAGTTT 

ACCTCCGCCT 

CAGGGGCCTG 

AAGTTGGCCA 

AAAGTGCTTA 

TATATTGTTA 

TATAAATTAA 

GTACTATCTG 

TAGTCTCCCC 

AGAGCTAATT 

ATTTTTCCAT 

AAACAATTAA 

TTTGGGAGGC 



TAAAGAAGAA 

TGTCAGAGCT 

CTGCTCTGAA 

TCAAACTTGG 

GTGCTTCTGG 

TTAAAAAGGC 

AGGCGGCTGG 

AGCCGGCCGC 

CGAAGCCCAA 

CCAAGGTTGT 

AAAACCCAAA 

CTTTACTATC 

CTGAGGTATC 

CATTTAAACC 

GG TTTTGGG A 

AGCCAGTTTC 

CCCTGCTCCC 

TGGTGGAAGA 

CTTTTCTCTT 

TTGATGTCTC 

CTTTGTCCTA 

TAAAAAACAG 

GGGAGTTTTT 

CGATTTTGAG 

GGTTGCTTAT 

TTTTGTTTGG 

AGAGTGTATG 

TTTGACATCC 

AAATCTGGTG 

AGCTGCTCAA 

ATAGTGACCA 

GAAGACAGAG 

GTTAATTTTA 

AACATATGGA 

TTCCTCCCGT 

CGCTATAATT 

TTAGTCATTC 

TATCCTCAAT 

TGTAGTATCT 

TCTCATTACT 

GAGAGATCAC 

CCCTCTGTCA 

CACGGGTTCA 

CCACCATGCC 

GGCTGGTCTC 

GATTACAGGC 

TAATTGTTGT 

ATTCATCATT 

TGGTTTCAGG 

TTTGGGAGAC 

TTCTCTCATT 

ATGTATATTA 

AAAAAAAAAA 

CTAGGCGGGT 
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GGATCACGAG 

TAAAAGTATA 

GGCTGAGGCA 

CACTCCAGCC 

CTCACGCCTG 

AGTTCAGGAC 

GCTGGGCGTG 

CACTTGAACC 

GGGCGACAGA 

ATTGAACTTC 

TTCTTATCAT 

TTCTAATTTT 

TAATCCTAAC 

AACACATTCC 

TTAGATTCAG 

TCCATCACAA 

TTAAAATTGG 

ACTGTCAATG 

ATTTAACTGA 

ATTACATGCA 

CACAGACTTA 

TTTGGGAATG 

AAGATAGAAA 

ATATCTAGTG 

AGGGAAATTT 

TCCTACACCT 

ATAATTTGGG 

TTATATCATT 

ATTCAAATAA 

CAACTGATAA 

AACATAAAAA 

TTATTTATTT 

GATCTTGGTT 

CTGAGTAACT 

GTAGAGACGG 

CACCTACCTC 

TTATTCCAAA 

ACTCCTAGGC 

CATTGTGGTT 

C TAAAGGAT A 

CTAGCATTTT 
TTATTTTCAA 

TCAGGGTGAA 
TGTGTGTGTG 
TTTCACATCA 
GATTCATCTT 
CACAGTTCTT 
ACCCAGGCTG 
AAGTGATTCT 
TCGGCTAATT 
CAAACTCCTG 
TGAGCCACTG 
TGATGAAAAC 
TACTTTGGGG 



GTCAGGAGTT 
AAAATTAGCC 
GAGAATTGCT 
TGGGTGACAC 
TAATTCCAGC 
CAGCCTGGCC 
GTGTCACACA 
CAGGAGGCAG 
GCCAGACTCT 
TGTGTTCCTT 
CTCCAAGAGT 
CCTTCAATGC 
CTCGAATGTC 
TTTAATTTAT 
TCTTGCATAT 
TTGTTCACAT 
CCATTTTAAG 
AGGACACATG 
CAAAGGACAG 
CAGAGTTCCC 
T AC AC C ATTC 
TGACAAGGAA 
TAATTGTAGT 
ATAAGAATTG 
CTGTCACCTT 
GTTGATTTTC 
GGTGACATCC 
TTTGATTTTT 
TTCCAGAAAC 
TTCAACCATG 
CTAGAGGCTA 
ATTTTGAGAC 
CACTGCAGCC 
GGGATTACAG 
GGTTTCGCCA 
GGCCTCCCAA 
CTTTCATACA 
AAAGCTCTGG 
GAAATTCATA 
TCAGAAGAGA 
TTGAGCACTT 
ACACATTCTT 
GGACTAAAGC 
TGTGTGCATT 
AGGTAAACTT 
CAAGTTAGCC 
TTATTTTGAC 
CTGGGCAGTG 
CCTGCCTCAG 
TTTGTATTTT 
ACCTCATGAT 
CACCCGGCCT 
TACAACATTC 
TCATTTTAAG 



CAAGACCAGC 
AACCATGGTG 
TGAACCCGGG 
AGCGAGACTC 
ACTTTGGGAG 
ATGAAAATAC 
CCTGTAATCC 
AGGTTGCAGT 
CTCTCAAAAA 
TCTCCCTTAG 
TAGTCAGGAG 
CCTTTGGGGT 
TTCTGCAAAC 
AGAGTTAAAA 
GTTTTCTCAA 
AGCTTACTGG 
ATGAAAAAGA 
TTTTTCTGTA 
ATTAACATGC 
AAAGAAAAAA 
CAACAAAGGA 
ATAAATACAT 
AAGGTTTGTT 
CTCTCTTTTT 
CACAAAGGGA 
AATTGCCTTC 
TGATATTCTT 
AAATTAGTTT 
ACTGCTGATA 
AAAATTTATG 
CTTGTAATGC 
ATAGTCTCTC 
TCCACTTCCC 
GCACCTGACA 
TGTTTGCCAG 
AGTGCTAGGA 
CAGTGCTATC 
ATATTTTGGC 
C C AG AGATG A 
ATAGGGATTT 
ATTTACAATA 
GTCACAGCAC 
TTGGTGTCAT 
TTTTTTTAAA 
TGTTCCTCTA 
CTTCTTAATA 



GCGTGATCTC 
CCTCCTTAGT 
TATTAGAGAC 
CCGCCTGCCT 
TATTTTGCCT 
TTCACCAAAA 
ATTAGGTGTA 



CTCGCCAAGA 
GCAGGCGCCT 
AGGCGGAGGT 
CGTCATAAAA 
GCTGAGTCAG 
AGCCTGGCCA 
TAGC TACTCG 
GAGTTAAGAT 
ACTAAATAAA 
ATACTTTCAT 
AGGAATCAAC 
CTTAATCCAT 
ATGTTTCCAC 
ATTAGAAAAA 
TTTTGTTCAT 
CTTAGGTCTA 
TTCTTGCCTC 
CTCTTAGATT 
GAAAAAAAAA 
AAATTGAAAC 
AAGGGAGTTT 
GGGCAATAAA 
TTTGCAGAGT 
CCTGGTATAG 
AATTTGGGTA 
AGCTGAAAAT 
CAAAACTTAT 
TATAAAATAA 
AGCCAAAAAC 
ACATTGTTCT 
ATT ATTC CAA 
TCTGTCACCC 
CGGTTCAAGC 
CCAAACCCGG 
GCTAGTCTCG 
TTACAGGCGT 
ATGGCTACAA 
TATATAAGCC 
ACAGGCC CAG 
AGGGTACAGT 
TGCCAAGCAC 
TTTGAAGTAA 
TAAGGATGTA 
TTTAAAGTCA 
AAGAG CTGGA 
GAACTGATGC 
TTTTTTTTTG 
GGCTCGCTGC 
AGCTGGGACC 
AGGGTTTCAC 
TGGCCTCTCA 
TCTTTAATCT 
ATCTTTGGGA 
TCTGCCTGGT 



TGGTG AAAT C 
GTAATCCCGG 
TGCAGTGAGG 
AAAAAAGCCG 
GCAGATTACC 
TGAAAACACA 
GGAGGCTGAG 
GACGCCACTG 
TAAAAATAAA 
GG CTACCC AT 
CCAAGCAAAA 
TTGATTTATG 
AGATGAAACT 
TTTTCAATTC 
GCTCTTTAGT 
ATGAACCATT 
AATTTTACTT 
CACTAAGTAG 
GCATGCAATT 
CTTAAAAACG 
GCACTTCATG 
AACCATGGAA 
CATCTCAGTG 
CAGTTGGGGA 
AAGAGAAGAC 
AACTTTTATG 
ATTTAATTTC 
TTTTGAAAAA 
ATCAATGAAT 
TGTGTGATAA 
ACTTTCTGTT 
AGGTTGGAGT 
AATTCTCCTG 
CTAATTTTTT 
AACTC CTGAC 
GAGCCACCAT 
ATTGAAGTAT 
TGAGGGAAAT 
TGCAAGACAG 
GGCAACAACA 
TGTTGCTGAT 
GTGCCATTGT 
GCTAGTTAGC 
ATAAATTTTT 
GTCAAAATGT 
TTAATCCACA 
AGACGGAGTC 
AACCTCTGCC 
ACAGGCGCAT 
TATGTTGGCC 
AAGTGCTGGG 
CCATTTGAAC 
TTTAATTTCT 
TCTCAATTTG 



CCGTCTCTAC 
CTACTCGGGA 
CGAGACCTTG 
GAAGCAGTGG 
TGAGGTCAGG 
CAATAAATTA 
ACAGGAGAAT 
CACTCCATCT 
GTTATGGTAC 
TTAATTGATG 
ATAGCTGATT 
TACTTTCAAT 
CGTCAAATGA 
TATTTGGCCT 
TTTGTTTTAT 
CATTTGGAAA 
AGTTTTTGAA 
TGTCTTGCAA 
TTATTAGTAT 
CGGTTAGACT 
GGATGACGAA 
GATAAAATGA 
CCAACCTTCC 
CACTTTTACA 
AGAGACCTCT 
CCAAAGTAGA 
ACATTAGTAA 
CGGTAATAAT 
ATTG CATAAA 
AACTATGAGT 
TTTTATTTAT 
GCAATGGCGT 
CCTCAGCCTC 
TGTATTTTTA 
CTCAGTGATC 
GCCCGGCGCA 
CATATTATAC 
GTAGTAAGGA 
AATTACATCA 
GTTTTGGGAA 
TACTCTATAT 
CATTCCCACT 
TGTGTGTGTG 
ATTTGAAGAA 
ATCTTCAAAA 
GTTGTCAGCC 
TCTCACTGTC 
TCCCGGGTTC 
GCCATCGTGC 
AGGCTGATCT 
ATTACAGGTG 
ATGGACATAC 
TCAACCACTT 
ACACCCTTTC 
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TCTCTAAACA 
TACAGATCTG 
AGTTTTGCTA 
CCTCAGAAAC 
AGTGGAAAAA 
AAGGTATCTA 
CTGGTGTGTA 
TAATACTCAC 
AGACACAAGC 
TCTGACCTGT 
CTATTGTAAC 
GACAGGGGAA 
AATCAGTCTC 
CCAGTGAGTC 
GTCCTCTTGT 
TCCAGTTTAT 
CTCAAGCATT 
TAGAATCTTG 
ATTTGTTAGT 
GGGCTGTTAT 
TAATCCCAAA 
TAGGTAAGAT 
CACTGTCCAC 
AGGACACCAC 
CTATATATGA 
GGGTTATCTT 
GGTAGTAAGA 
ATGGAGACAG 
AGACCAACCA 
AGTTTATACT 
CTAAAAGAAG 
GAACTCACTT 
TAACCTTTCT 
AACAGTGGTC 
TTGAATAGAG 
AGATTTATCC 
AGGGGCCCTA 
TATTAAGGTA 
ATTAAAATCA 
AAGAGAAAGT 
ATGTGATCAT 
GGCCGGGCGC 
TCACGAGGTC 
AAATACAAAA 
CTGAGGCAGG 
CACTGCACTC 
TGGCTTCAAG 
GCAGGTCTAG 
GTGAGTGTCT 
TACAAAGTTT 
CAATCCTAAA 
GTTATGAAGA 
TTACCTATAG 
CAAAACTACA 



TGAATGAGTT 
TGGAATATGC 
GTTTTTGATC 
ATAGGGTCCA 
GAAATCTATT 
ATATTAAAAT 
TTTTACACTT 
ATATGGTTAG 
AAATACTTGC 
TCGTGAAACC 
CACCCAACGG 
TTGCAATAAG 
CTTGAGAATT 
GGGAGTGCTG 
G CTAAATC AG 
CTATATGGGT 
GATCTTAGGT 
TAGCTTCCAG 
CCTGCAAAAG 
TGTTTTTGTT 
CTCAGGAATG 
CTCTTTCACT 
TTCACCTCAC 
CATGTCTCCT 
TAGTATGAAA 
ATTGATTAGA 
ATTTGGCGAA 
TGAGATAATT 
CTTCATGTGG 
ATTAATGAAT 
GTAATATTCA 
AACCACTGAA 
CCTTCTGGTA 
ATCCCAGACC 
TGGTAGGATG 
TGAAG CTAAT 
CAAATGTGTT 
CTTTAATCAC 
CCATTGTCTG 
TTAGTTGAAG 
GTGTACTTCA 
GGTGGCTCAC 
AGGAGATCGA 
AATTAGCCGG 
AGAATGGCAT 
CAACCTGGGA 
GAATGTTCCT 
ATAAAATGTT 
AGTGGAGAGT 
ACAACTTACA 
AACTTACTTG 
AAACATATTA 
ACAACATTAC 
GGAAAATATA 



C CAATCAT AT 
CAAAAGTTAA 
TGTGAGTGAA 
CATATGTAAT 
TTAATGATTT 
ATTGAGTTTT 
AAAGCACATC 
TGGCAACTAT 
TCTGCAGCAG 
CAGGTAGTGT 
GCTCTCCTTG 
GAGCCAGCGC 
TGGGGACCAA 
CTTGGTTGGG 
TTCCTGGGAG 
GGTGCCAGCT 
TTTAAAATAG 
CTGCATGACT 
CAGTCTGGTC 
TAAAAGCAAA 
AAAAGGACAG 
GTAATAATTT 
ATCAGGCCTC 
TATCCACCCT 
CTATATATGA 
AGATATTAAA 
TTTAGTGAAA 
TGCCTTACAA 
TACTTGGCCC 
CCTTTGTTTC 
ATACAAATAA 
GTGTTCAAAT 
TTTCTTCTGA 
AGTAATTCTC 
CTGAAGAAGG 
GAAACACAAG 
AGTTTGTCTC 
GGATGGTTCA 
ATTATGTTAG 
ATGTATCTAG 
TTCGTTGCCA 
GCCTGTAATC 
GACCATCTTG 
GCGTGTTGGC 
GAACCTGGGA 
GACACAGCGA 
ACTGCTCACT 
ATGACATCTA 
AGAAACGTAT 
TGTGAAAGGA 
ACATTACCAA 
TCATCAGCCA 
AAAATAATTT 
CTTGGTAGTG 



TTATTCCTAA 
GGTGAAAAAT 
TATAACTATC 
TTTAAATTTT 
GAATCCAGTG 
TACTTTGTTA 
ACAGTTTGGA 
CTTGGACAGG 
AATCCAGATG 
CTCTAATACT 
TCCACTTCCT 
TACAGGAGAC 
AGTTTTTAAG 
TCAGAGATGA 
TGGTGGGGTG 
AATCCATTGT 
TGATTTTATC 
CCTAAACCAT 
CCCAGGCAGG 
AGTATAAACT 
CTTGGAGGTT 
TCTCAGTTAT 
TGACTAGAGG 
GAGGGATTCC 
GAAGGAAATT 
GTGTGACACT 
TTCCTGAGGC 
TGCTGAAGTA 
GTGGAAGACT 
ATTGTTATTT 
AGTTAAAACA 
TGCTTAAGGT 
GAACAGCACC 
AACTCACAGG 
CCACGTAAAA 
TGTAAGGGCC 
TCTCTCTCTC 
GGCTGCTATT 
AATCCTGATG 
TATGGGGATA 
GCCAATCTGA 
CTAGCACTTT 
GCTAACACGG 
GGGCGCCTGT 
GGCGGAGCTT 
GACTCCGTCT 
GGAATAACTC 
AGTATT C AAA 
AGAGCCAGAA 
GCTTAACAGA 
TAATGTGTTT 
CCCTGGAGGA 
CGATCTGAAG 
TCATATTCAG 



GCTATCACAC 
TAAATTATTA 
CTCTATGTCC 
TTAATAGGCA 
TAACCAAAAA 
TTTTACTAGG 
GTAGCCACAT 
ACAGCTTTTA 
TTTTCCAAGA 
TTATATTTTA 
AGACAGAGCT 
TAGAGTTTTA 
GATAATTTGA 
AATTATAGGG 
GGGGACTCAA 
GTTCAGGGTC 
CCCAGGAGCA 
AATTTATAAT 
AAAGGGGTTT 
AAGCTCCTCC 
AGACGTTAGA 
GATTTTTGCA 
ATTCCAACAA 
AATTTCTGAA 
ATATATGATA 
GCCTGGCAAT 
TGAACCTCCA 
AGAATTTTAC 
ATCAATGACA 
CCTTCTACAC 
GCTTGCAGAG 
TG AC TTT AT A 
ACCATCCAAA 
GTGCTCCTGC 
TTTGGCCAGT 
TGTACTTC C A 
TCTGATTTTA 
TTCACTCAAT 
AAAATATTTG 
ATAAGTTACG 
CGTAAGAATG 
GGGAGGCCGA 
TGAAACCCCG 
AGTCCCAGCT 
GCAGTGAGCC 
CAAAAAAAAA 
AC CT AAATTC 
ACACATTCCC 
GCTAGTCTGG 
GGATTTTCCA 
TGAAACTGAA 
AAGATTGAAT 
ATGGAATCAG 
AAGTTAATAA 



TCAAATATAC 
GGTATTTCAT 
TGGCACTGTT 
CATTTTAAAA 
TTGTTTCAAC 
TCTTTGAAAT 
TTCCAATGCT 
TACTCTGGGA 
AAACACTTTT 
TTGGTTTGTC 
GATTTATCAA 
TTATTACTCA 
TTGTAGGGGA 
AGCCTAAGCT 
GACCAGATAA 
TGCAAAATAG 
ATTTGAGGTT 
CTTGTGGCTA 
GTTTCTGAAA 
CAAAGTTAGT 
TGGAGTCGGT 
AAGGCAGTTT 
TACTTAGGCC 
ACAAAGGAAA 
ATCAATTTTA 
GATATCTGCT 
CTTCTGTAAA 
ACAATAATTC 
GTTAGTTTAT 
GTTGGCCTCT 
TTGTCCCAGG 
TTCTCCTGAC 
GCATCATGCA 
AGAGATGTAT 
GATCTGGGGC 
AGGTGCAGAG 
AAATTTGCAG 
CCTCCTTTTT 
GAATTTGAGT 
TGATTTGCAT 
GCTTCAAGGA 
GACGGGCGGA 
TTTCTACTAA 
ACTTGGGAGG 
GAGATCGCGC 
AAAAAAAGAA 
CTGGCAAGAT 
AGCACTGAGA 
AAAGAATTCT 
AATTTGAAAA 
ATACTTCTAA 
TCTATTTCCA 
AGTATTCAGT 
AATATGCTAT 
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87481 TTTCTGAATT TTGTGATGGC TGTTGTTTTG TCAGCTTTTA TAAAATTGGA ATTTGATTTT 

87541 ATTTTCCCAT TATAAATTTA TATTTACAGT CTGCAGTACT TTTGCATTTT TAATTTTACA 

87601 TTATAGCTTT TAATAGTTAA CAAGTTGTAA AAGGTTTGAT CCCCAGAAAA CCTTGATCTA 

8 7661 CCCCCTCAGT TAAGTATACT AATATATTTA GAAAATGGAT GAAATCAGCA TTTGAATATT 

87721 TTTAAATATT TATTAAAAGA GGACATGGGT AAAAGAGCTT TGCAGTTGCC ACCCTTCATT 

87781 CTCAAATTCC CTGGATAAGG ATG AC CGCAT AATCTTTGGA TGGTCATACG CAAGTCTTGT 

8 7841 GTATTTGTTA CATAAATCTA TTTAGTGGAC TTTTGGCAGT GTGTACTGAG GCCAGTTTCT 

8 7901 TCCACCTGAG CTCTGACTCC ACCTCCAGCA GCCCAAAACC AATACTGAAT TTTGGGGTCA 

87 961 GCTATTGTTT TTGTGGACTT AGGTAACTAC ACACACATTG TCTTTATGAT AG CTTTAATA 
88021 ATACTGCCAT CAGAACTAAA ATTGTCACGT GGATTAAAAG GAGTGACGGT GGTGTCCCCA 
8 8081 GGAGCCTTTC AATATGTAAG TATTTACACA TATACATGCT AAAAAGACCC CTAGGAATTT 
88141 TTTTAACAAG GGCAAAACAG TAACTCAGCT TGTTTTCTCG CAGTAAAACC GGTTGAAAAG 
8 8201 GCCTGATAGA CTTGTCTGCA GTTACAAAAC TTGTGTGTAG TTATCACCTT TATATCTCCT 
88261 GGAAACTAAC ATAGACAACC GAATGGGTTA CAACTGTTTT TAAGTGAAAT TGTGAGTGGC 
883 21 TCTGAAAAGA GCCTTTTCAA TGAGGAAGAA ACGGGCAGAC TTATGCCCTT TCCCCACGGA 
8 8381 TGCGACGTGC CAGCTGGATA TCTTTGGGCA TGATGGTGAC GCGTTTAGCG TGAATAGCGC 
8 8441 ACAGATTGGT GTCTTCGAAG AGTCCCACCA GGTAGGCCTC GCAAGCCTCC TGCAGCGCCA 
8 85 01 TCACCGCAGA GCTCTGGAAA CGCAGGTCGG TTTTGAAGTC CTGGGCGATT TCTCGCACCA 
8 8561 GGCGCTGGAA CGGCAGCTTC CGGATCAGCA GCTCGGTGGA CTTCTGGTAG CGACGGATTT 
8 8621 CGCGCAAGGC CACGGTGCCC GGGCGGTAGC GATGAGGTTT CTTCACGCCA CCGGTGGCCG 
8 8681 GAGCGCTCTT ACGGGCTGCT TTAGTAGCAA GCTGCTTGCG CGGAGCTTTG CCG CCGGT AG 
8 8 741 ACTTGCGAGC TGTTTGCTTC GTACGAGCCA TTTGCAATGA GAGCACACAC AAAAGTGTAG 
88801 TGAACTGAGA GCAAGTGGCC TTTAAATATA GTGAGAAACA TTCTGATTGG TCCTGTAATA 
88861 TTTCAAAAGT CCCGCGCGAT AAAATCATTG GCTGAAGAGT GACCAGACTG ATTGGTTCAT 
8 8 921 TACTAGACAA TCTTATTGGA TGAGTTGCCC CACCGCCCAT CCTGTCCTTT TCGTTTCAGT 

88 9 81 TATCTGCAGC GACAAATTGT CTAAAATTCT AGTTCATCCA GTCCCAAAGA ACAGAGTGTA 
8 9041 TAACAAGGTA TCTAAGGATT TTTAAAATGT AAATTCCGAT TCAGTAAGTT TGAGTGGGAC 
8 9101 TTGAAATTCT GCATTCCTGA CAGTCTCGCA AGTTATCAAT GCTGGTGAAC ACTCACTAAA 
8 9161 C C AC C AG AAA CGTTCAGACT CATGTCGGGA AATAACGCTT ATATTCAGAG AATGAGATTC 
8 9221 CATGCTATTT TGTTACTGGC GAACAGCAAG TTTCCTTGCC CTTTGTTTTC TAAGTCCAAG 
8 9281 TCACATTCCC ACCCTGCCTG TTCTCAAAAT GTCTTATTTT GGTTGGCCTT AAGTTTCACT 
8 9341 TTGTATACTC TAAAATGTAC TTTCTAAAGG AAGGTGTTAT TTTCTCGAAA CTTAACTTTT 
8 9401 TAACACCATT AGGCTAGGGG GGCGGTGGCT CACGCCTGTA ATCCCAGCAT TTTGGGAGGG 
8 9461 CGAGATGGGA CGATCACTAG AGGC CAGG AG TTCAAGACAA CCCTGGCTAA AATGGTGAAA 
8 9521 CCCCGTCTCG CATAAAAATA CAAAAACTAG CTGGGCGCGG TAG C AG ACG C CTGTAATCCC 
8 9581 AAGTACACAG GAGGCTGTGG C ATG AG AAC C GCGTGAAGCG GCGGGGT3GA GGTTGCAGTA 
8 9641 AGCCGATATC GCGCCGCTGC ACTCCAGCCT GGGTGACAGA GCTAGACTGT CTCAAAACAA 
8 97 01 ACCAATCCAA ACGAAAAGCA AAAAATACCC TAACAGAAGC AAGTTATCAT CCTTTCTTGT 
8 9761 GTAACTATGG ACGGCTCTGA AAAATGCCGT TTCAAGTGTA AG CTACGTTT TCTGATTTGA 
8 9821 GTGTTTACTT GACCTTGGCC TTATCGTGGC TCTGTTATTT TGGCAACAGG ACGGCCTGAA 
8 9881 TATTGGACAG GACGCCTCCC TGAGCAATAG TGACGTTGCC CAGCTGCTTG TTGACCTCCT 

8 9941 CGTCGTTTCG GATGGCCAGC TGCAGGTGGC GGGGGATGAT GCTGCGGGTC TTGTCACGTA 

9 0001 TGGCGCTGCC CACCAGTTCT AAGATCTCGG CGGCCAGGTA CTGTAAGTAC ACTGGCGCAC 
90061 CGGCTCCGAC CGGCTCAAAA TAATTGCCCT TTCGAAAAAG ATGACGGACT CTGCCCTATT 
9 0121 GGGAACTGCA AGCCCGGTAG CGACGAACAA GTTTTTGCTT TAGCTCCATT TTCCACGTCC 
90181 GCAAATAGCG ACCTATGAAA GCAGCGGAAA ACTGTGAAAG ACAAGCAAGC TGGAATGGCG 
90241 CCTGAACAAA TCCTTTTATA CAAACTGCAA GGCTGCAATA GGAAGCTATC CTATTGGTCA 
90301 ATTATGTTTG GTGCTTTATC CAATAGAAAA AGATAACATA AATTCCATAT TTG CATAAAC 
90361 CCCACCCCTC AGTGAAACCG TGTTTCTTTT GTCCAATCAG AAGTGAGGAA TCTTAAACCG 
90421 TCATTTGAAT CTCAGGACTA TAAATACATG GGCTCTGAAC TGTTCTCTGT ACTACTCTGT 
9 0481 AGTGGAGAGT GTTAGTAGCT TTTCTATTCT GTTTAGGAAT AGCAATGCCT GAAC CCTCTA 
90541 AGTCTGCTCC AGCCCCTAAA AAGGGTTCTA AGAAGGCTAT CACTAAGGCG CAGAAGAAGG 
90601 ATGGTAAGAA GCGTAAGCGC AGCCGCAAGG AGAGCTATTC TATCTATGTG TACAAGGTTC 
90661 TGAAGCAGGT CCACCCCGAC ACCGGCATCT CATCCAAGGC CATGGGGATC ATGAATTCCT 
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90721 


TCGTCAACGA 


90781 


AGCGCTCGAC 


90841 


AGCTGGCTAA 


90901 


AATAAGTGCT 


90961 


TCACAAGGAG 


91021 


T C TAG AGGAT 


91081 


GGGGCCGTGC 


91141 


GATCACTTGA 


91201 


C T AAAAT AC A 


91261 


CTTTGTTCCC 


91321 


CTCCCTCCCT 


91381 


TTCAGGGTAT 


91441 


AGGGGAGCAG 


91501 


GCCTATGCTG 


91561 


CGGGCGCCTT 


91621 


TATAATTGTT 


91681 


CCAGATTCCC 


91741 


GGGTTTGGAT 


91801 


CACTTGGAGA 


91861 


AAAAAAAAAA 


91921 


TCTGGGGCTG 


91981 


AAGCAGGAGG 


92041 


TGTAGGTGAC 


92101 


TTTTTGTTTT 


92161 


CTCGACTTCC 


92221 


GAAATGCACC 


92281 


TTGTTGCCCA 


92341 


CCAAAGTCCT 


92401 


ATCTTTTAAA 


92461 


TTGGGAGGCC 


92521 


TTAGTGAGAC 


92581 


TTCTGTAGTC 


92641 


GAGGTTGCAG 


92701 


TCTCAAAAAT 


92761 


TAAGTGAAGC 


92821 


TTGGCTTAAA 


92881 


TACTAAGGAA 


92941 


TGAGACCTGG 


93001 


GCCTGTAATC 


93061 


AGACCAGCCT 


93121 


GCATATGCCT 


93181 


GAGGCAGAGG 


93241 


ATGCACCCAC 


93301 


GGTGGCTCAC 


93361 


AGGAGATGGA 


93421 


TAATTAGCTG 


93481 


GAGAATGGCG 


93541 


CCAGCCTGGG 


93601 


AAATATGAAG 


93661 


TGCCTGCCTT 


93721 


AGGGTTTCTG 


93781 


CCCATCCAGA 


93841 


CCGCAGTGCA 


93901 


CCATAGACAG 



CATCTTCGAG 
CATCACCTCC 
GCATGCTGTG 
TATGT AAG C A 
AGCTATAACC 
CAACTGGAAT 
GCGGTGCCTC 
GGTCGGGAGT 
AATGATAGAC 
CCTGGGTAAG 
GGTCTAGTAC 
AGTGTTCCTG 
AAAAGTCTAA 
TAAATTCTTA 
TATACGGAAT 
TATCAATGAC 
ATTTCCTAAG 
TTTGTGCCCT 
GGGAAATCTT 
AAAAAAGGCA 
GGGCTGGGGG 
GGGTGGGGGA 
ATACAGCAGT 
GAGAAAGGGC 
CCAGCTCAAG 
ACCATACCCA 
GGCTGGTCAA 
GGGATTATAG 
AGAGGTTCTG 
AAGGTGGGAG 
CTTTTGTCTC 
CCAAGTACTG 
TAAGCTGTGA 
AAAAAATAAA 
ACTTCCCATC 
AATCTACATT 
TTGAGGCTGC 
TAATATAAGC 
CCAGCACTTT 
GAGCAACATG 
ATAGTTC C AG 
TTGCAGCAAG 
GCCCTAAAAA 
GCCTGTAATC 
GACCATCCTG 
GGCGTGATGG 
TGAACGCGGG 
TGACAGAGCG 
TTTTGAAGCA 
CTTCCTTTGT 
TACTATAGTC 
CCCCAAGAGA 
AAGTAAATGC 
AGCAGGACAT 



CGCATCGCGG 
AGGGAGATTC 
TCCGAGGGCA 
CTTCCAAACC 
ACAATTTCTT 
GTTAGCGAAG 
TTGCCTTTAA 
TCGAGACTAG 
GGTCGTGATG 
CCTTCGGGTA 
AGGAAACTTC 
TGGGGGTCAT 
GCGACAAAAG 
CTTCAAGTAT 
ATTTCCCGCT 
AACAGCTATG 
CCACTTAACG 
TCCCCATCTG 
TTTCGAGAAG 
GGAAGAGCAC 
AAG AAATG C A 
ATCGGAGGGG 
GTCTTTGGAT 
CTTTCTCTGT 
CGATCCTCTT 
GTTAATTTTT 
GCGAACTCCT 
GAATGAGTCA 
GGCCGGGTGT 
GATCACTTGA 
CACCAAAAAT 
GGGAGGCTGA 
CGGCACAACT 
AAAAAATCTG 
CTAGTACTGT 
CTTTTTTTAA 
AGTTTAAGAA 
ATTTTCAAAA 
GGGAGACCTA 
GCGAAATCCA 
CTACTATAGA 
CCAAGATCGC 
AAAGCATGAC 
CCAGCACTTT 
CTTAACACGA 
TGGGCGCCTG 
AGGCGGAGCT 
AGACTCCGTC 
GAAATTATTT 
T AC AG AACT C 
CCTTCTGTGG 
GGGTTCTTGG 
AAGTTTACTA 
TCCCGAAAGT 



GCGAGGCTTC 
AGACGGCTGT 
CTAAGGCAGT 
CAAAGGCTCT 
AAGGTGGTGC 
ACAAGTTTTA 
TCCCGGCAAT 
CCCGGCCAAC 
GCGCTCTTTC 
CTATGTATAA 
CCTTTCTGGA 
TAGCCGTTAA 
GGCATGTAGG 
TGAGGAAACA 
CCACAAAATG 
TAGTTTACAT 
TTCTGATTTC 
GCGCCACTGC 
TCCAGGACGC 
TAGTTGAGGA 
AGAAGAAAAG 
AGTATTTTCA 
GAAGAAATAA 
CGGCCAGGCG 
ACTTCAGCCC 
TAATTTTTTG 
GGGCTCAAAT 
CCGCGCCCGG 
GGTGCAGCTC 
GCCCAGGAGC 
TTAAAAAATT 
AGTGGGAGGA 
GCACTCCAGT 
GATGC CAC AC 
ATATGCAAAC 
TTATAAAACT 
GCTGATATTT 
TGAACTTTTG 
GTCAGGCAGA 
GTCTCTACAA 
GGCTGAGGTG 
GCCGCCACAG 
TCATTAAAAA 
GGGAGGCCGA 
TGAAACCCCG 
TAGTCCCAGC 
TGCAGTGAGC 
TCAAAAAAAA 
TGTCGTATGT 
CAACACTTAC 
TGGCCAGAAA 
ATCCCGCGCA 
AGAAAGTAAA 
AAG AG G AGG A 



TCGCCTGGCT 
GCGCCTGCTG 
TACCAAGTAC 
TTTCAGAGCC 
TGCTGCTATT 
GAGCCAAGGT 
TTGGGAGGCC 
ATGGCGAAAG 
TCATCTGTCT 
TTCCTTTGAT 
TAATGAAGCA 
CTTCTTGTGA 
GATATTTGCT 
ATAAGCGAAG 
AAATCGCAGT 
ATTTCATGCA 
CAGCTCTGCG 
AAAGCTTACT 
CAAAAACAAT 
GGAGGACTCA 
ACACTTGTTG 
GCGAATTTAT 
AGTTTCTCAA 
CCATCATAGC 
CTTGAGTGGC 
TGGAGGCAAA 
GATCCTCCCG 
CCCAGATTTA 
ACGCCTGTAA 
TCAAGACCAG 
AACCAGGCCT 
TCATTTGAGC 
CTGGGTGAGG 
AAAATGTCAG 
TGCCGTTGTG 
ACCACATCCC 
AGGATCTATC 
GGCCAGGTGA 
TCACTTGAGC 
AAAATTAGCA 
GGAGGATTAC 
CCTGAGCGAC 
AAAAAAATTT 
GGCGGGCGGA 
TCTCTACTAA 
TACTCGGGAG 
CGAGATCGCG 
AAAAAAAAAA 
TCTTTCATAA 
CCAAAGGTAG 
TATGTTACAG 
AGAAAGAGTT 
GTGGTGAAAC 
AGGCATCCAC 



CACTACAATA 
CTGCCTGGGG 
ACTAGCTCTA 
ACCTACTTTG 
CTGTTTCAGT 
TAACTTGGAC 
GAGGCGGGCG 
CCCGTCTCTA 
TAGCAAACTT 
AAGGTCACTA 
GGTAATGGAA 
GATGCGGGGG 
CCTGCAGCTT 
TCTGATTTCC 
AGTTTTGAGT 
TCCCAGAAAT 
AGATACAAAA 
AGGAGGGCCC 
ATAGCTAAAA 
ATGGGCCAAT 
ACTGCACAGT 
GGGCATTATA 
ACAGTTCTTG 
TCACTGCAAC 
TGGGACTAGA 
GGGTCTTACT 
CCTTGGCCTC 
ATTTTTAAGA 
TACCAGCATT 
TCTGGGCAAC 
GGTGGCACAT 
CTGGAAGGTG 
ACAGACCCTG 
TGAACAACTG 
AAAGTGACGC 
C C AAAAAC AT 
TCCGGAGAAG 
GGTGTGTCAT 
TCACAATTCG 
GGGCGTGGTG 
TTGAGCCCGG 
AGAATGAGAT 
AGCCGGTCGC 
TCACGAGGTC 
AAATACAAAA 
GCTGAGGCAG 
CCACGGCACT 
AAAATTAAAA 
ATTTTTTGCC 
CTGTTGGGTC 
GAAAGAGGTC 
CAGGGTGAGT 
GACAACTACT 
CCTAGGTACA 
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ATACTTGTAT 

TTAGTTACTA 

TTGTTCTCCA 

TATTTATTTG 

ATGCAGCCCA 

GGTTCAAGTA 

TTTATTTTTT 

CCCATAATTG 

GCTGTCTCTG 

GTCCCCCTCA 

CTATGCATTT 

CATTTCCGAA 

CTTGCTAATC 

TGCAATGGCG 

GCCTCGCCCT 

GTATTTTTAG 

TCGTGATCCG 

CCGACCAATC 

AACAGAATTT 

TTTTGGTGAC 

AATCTTCTAT 

GATTCTAATG 

ATTCACTGAC 

ATTTCTCTAC 

TGATATACAT 

CACTCTCACC 

GTTAGTCTGT 

GACAATTATT 

TACTCATGAT 

GATAGCTCCA 

TCTAAGATTT 

GTGCAATGGT 

TGCCTCAGCC 

GTATTTTTAT 

TTGGCTTAAA 

TACTAAGGAA 

TGAGACCTGG 

GCCTGTAATC 

AGACCAGCCT 

GCATATGCCT 

GAGGCAGAGG 

ATGCACCCAC 

GGTGGCTCAC 

AGGAGATGGA 

TAATTAGCTG 

GAGAATGGCG 

CCAGCCTGGG 

AAATATGAAG 

TGCCTGCCTT 

AGGGTTTCTG 

CCCATCCAGA 

CCGCAGTGCA 

CCATAGACAG 

ATACTTGTAT 



ATATGGGGAG 
TATTTTGCAA 
GATATAGGGA 
TTCCCTTAAC 
GAAAGTCTCA 
ACTCTGACAC 
ATTTTTGAAA 
ATAAG CCAAA 
CTGATACTCG 
GTTTATTACC 
CACAAAACTT 
GGGTCCCATG 
TTTTTTTTTG 
CGATCTCGGC 
CCCGAGTAGC 
TAGAGACAGG 
CCCGCCTCGT 
TGTCTTTTTG 
TCTTTTCCCC 
CAATCTTACA 
AAGTGAGATT 
ATTATTTTCA 
CTTCGCTTTT 
ACACAAGATT 
ATTTTGATTT 
AACAGGGTGT 
TCAAATTGCC 
GTTTGAGACT 
TCTTGCCCAT 
TGTATTAAAA 

GCGATCTCGG 
TCCCCAGTAA 
TAGAGATGAG 
AATCTACATT 
TTGAGGCTGC 
TAATATAAGC 
CCAGCACTTT 
GAGCAACATG 
ATAGTTC C AG 
TTGCAGCAAG 
GCCCTAAAAA 
GCCTGTAATC 
GACCATCCTG 
GGCGTGATGG 
TGAACGCGGG 
TGACAGAGCG 
TTTTGAAGCA 
CTTCCTTTGT 
TACTATAGTC 
CCCCAAGAGA 
AAGTAAATG C 
AG CAGGAC AT 
ATATGGGGAG 



ATGTGCTCTG 

GAATCAACAT 

TATCTGGACA 

CGTAAACATC 

GCCTCATTTT 

TTTTCTTCTC 

TAAGAAATCA 

ACAAAAACCT 

GCTGATCGTT 

ATTAGATCAT 

GCCATAAAAA 

TAATATAAAA 

TTTTTTGAGA 

TCACTGCAAC 

TGGGACCACA 

GTTTCACCGT 

CCTGCCAAAG 

TAGAGGGGCC 

TACAATATAA 

GAAATTTTAT 

GTATTTCACT 

TTACTGCATT 

TAAAAATTTA 

GCTGTAAGGG 

TTAATACATG 

TTTTTCCTGA 
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CTTAACACGA 
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AGGCGGAGCT 

AGACTCCGTC 

GAAATTATTT 

TACAGAACTC 

CCTTCTGTGG 

GGGTTCTTGG 

AAGTTTACTA 

TCCCGAAAGT 

ATGTGCTCTG 



CTACAAGTTT 

TATTATCTTT 

CTCCTAAGTC 

TAGAAGCTAG 

CCTAGCCCTC 

TTTTTTTCTT 

AGAATACTTG 

AGGTCTTCTA 

AATAGGTAAT 

ATGCCTACTG 

TTCACAGGTT 

CTTATATTAA 

CTGAGCCTTG 

CTCCGCTTCC 

GATACGTGCC 

GTTGGCCAGG 

TGCTCGGATT 

TCAAG CATGA 

ACATTAATTG 

CTTGTGCAAG 

TTTCTAGTAT 

TCATTGTAGG 

AACCATGTTA 

CAAAAATAGA 

TTACCAAGTT 

CTTCCACAAA 

ATTAAATCTC 

ATAATAACAT 

GATGTTGCCT 

TTTGAGGGCT 

CGGAGTTTCA 

CCTCCGCCTC 

TGGCAAGCGC 

GTTGGTCAGA 

TTATAAAACT 

GCTGATATTT 

TGAACTTTTG 

GTCAGGCAGA 

GTCTCTACAA 

GGCTGAGGTG 

GCCGCCACAG 

TCATTAAAAA 

GGGAGGCCGA 

TGAAACCCCG 

TAGTCCCAGC 

TGCAGTGAGC 

TCAAAAAAAA 

TGTCGTATGT 

CAACACTTAC 

TGGCCAGAAA 

ATCCCGCGCA 

AGAAAGTAAA 

AAGAGGAGGA 

CTACAAGTTT 



GTGATAAAGG 

AAACAAAATT 

TGAGTCTGTT 

GAATGACTGA 

ACTCAAAATG 

CTTTTTTCCT 

ATGTTTCATC 

ACTCAAAACT 

TAACAAACAA 

TCAATCATAT 

TCCCGCTTCC 

ATACATTTGT 

CTCTGTCACC 

CAGGTTCAAG 

ACCATGCCCC 

ATGTTCTCAA 

ACAGACGTGA 

ACTTACTGAT 

TAATGTTATC 

TCTATGCAAA 

CCTTTTAAAT 

GAAGTAGATA 

CCATGAAAAT 

GATAGGAATC 

GCCTCCTGAA 

TGCTCTTGAA 

ATTGTTGTTT 

TTCTTCTATT 

TATGTACATT 

TATGATATGT 

CACTTGTTGC 

CAGGGTTCAA 

CACCACGCCT 

CTGGTCTCGA 

ACCACATCCC 

AGGATCTATC 

GGCCAGGTGA 

TCACTTGAGC 

AAAATTAGCA 

GGAGGATTAC 

CCTGAGCGAC 

AAAAAAATTT 

GGCGGGCGGA 

TCTCTACTAA 

TACTCGGGAG 

CGAGATCGCG 

AAAAAAAAAA 

TCTTTCATAA 

C C AAAGGTAG 

TATGTTACAG 

AGAAAGAGTT 

GTGGTGAAAC 

AGGCATCCAC 

GTGATAAAGG 



ATTAATTTTC 

AAGAATGCCT 

TAGTAAACAT 

CTTTCTGGGA 

GAGTTACTCT 

TCCTTTATTT 

TAAAACAATA 

AGGATGTTTT 

GCCTTGCTAT 

TAATCCACAA 

CTCGAGTTTT 

ATGCTTTTCT 

CAGGCTGGAG 

CGATTCTACT 

GCTAATTTTT 

TCTCCTTACC 

GCCACTGCAC 

GGGTGAGAAA 

ATTCAGGACA 

CCAATATGTA 

TAATAAAAGA 

ATTGCCCTTT 

GCTTTTCAGT 

ATGCATCCAT 

GGTCTGTTTA 

CAGTGGGTGT 

TTATTTTTAA 

ATGGTTTGAT 

ATTTTAAATA 

CAGTTACATT 

CCAGGCTGGA 

GCAATTCTCC 

GGCTAATTTT 

ACTGCCGACC 

C C AAAAAC AT 

TCCGGAGAAG 

GGTGTGTCAT 

TCACAATTCG 

GGGCGTGGTG 

TTGAGCCCGG 

AGAATGAGAT 

AGCCGGTCGC 

TCACGAGGTC 

AAATACAAAA 

GCTGAGGCAG 

CCACGGCACT 

AAAATTAAAA 

ATTTTTTGCC 

CTGTTGGGTC 

GAAAGAGGTC 

CAGGGTGAGT 

GACAACTACT 

C CTAGGTAC A 

ATTAATTTTC 
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TTAGTTACTA 
TTGTTCTCCA 
TATTTATTTG 
ATGCAGCCCA 
GGTTCAAGTA 
TTTATTTTTT 
CCCATAATTG 
GCTGTCTCTG 
GTCCCCCTCA 
CTATGCATTT 
CATTTCCGAA 
CTTGCTAATC 
TGCAATGGCG 
GCCTCGCCCT 
GTATTTTTAG 
TCGTGATCCG 
CCGACCAATC 
AACAGAATTT 
TTTTGGTGAC 
AATCTTCTAT 
GATTCTAATG 
ATTCACTGAC 
ATTTCTCTAC 
TGATATACAT 
CACTCTCACC 
GTTAGTCTGT 
GACAATTATT 
TACTCATGAT 
GATAGCTCCA 
TCTAAGATTT 
GTGCAATGGT 
TGCCTCAGCC 
GTATTTTTAT 
TCAGGTGATC 
GCCCGGCCAC 
AATGAAGTGT 
CTTACTGAGT 
AAGGGGTGTT 
ATCAGGTTTT 
GCTTGTTTTT 
GTAATTTTCT 
GGCTAAAGAG 
GGTCCCAGAA 
TATGCCTCAT 
ACAGCTATAT 
TACCTCCCAT 
TACATCACAA 
AATGGCTTAA 
CCAATCAGAG 
CTATAGTTAA 
GCCATGCCAG 
AGTCTGTCTT 
ATACAGATAC 
TAATTTCAAC 



TATTTTGCAA 
GATATAGGGA 
TTCCCTTAAC 
GAAAGTCTCA 
ACTCTGACAC 
ATTTTTGAAA 
ATAAGCCAAA 
CTGATACTCG 
GTTTATTACC 
CACAAAACTT 
GGGTCCCATG 
TTTTTTTTTG 
CGATCTCGGC 
CCCGAGTAGC 
TAGAGACAGG 
CCCGCCTCGT 
TGTCTTTTTG 
TCTTTTCCCC 
CAATCTTACA 
AAGTGAGATT 
ATTATTTTCA 
CTTCGCTTTT 
ACACAAGATT 
ATTTTGATTT 
AACAGGGTGT 
TCAAATTGCC 
GTTTGAGACT 
TCTTGCCCAT 
TGTATTAAAA 

GCGATCTCGG 
TCCCCAGTAA 
TAGAGATGAG 
CACCCGCCTC 
ATTTCTAAAT 
GAGTACTATT 
TCTATACACC 
GAAATCTCTA 
G CTAC AC ATA 
CGTATTGGAT 
TTGCTCTGAA 
TAGAAAGGAG 
CCTGCCAAAA 
AGGCTATATA 
ATATTATGAG 
GTTCACTTCG 
GGTGAACTAT 
GGTCTACAAT 
TTGTAGTGGA 
GGAATTTAGC 
CCAAGCCATG 
AGTTGGTATA 
TCTTGCAGTT 
CTGCGTTATG 



GAATCAACAT 
TATCTGGACA 
CGTAAACATC 
GCCTCATTTT 
TTTTCTTCTC 
TAAGAAATCA 
ACAAAAACCT 
GCTGATCGTT 
ATTAGATCAT 
GCCATAAAAA 
TAATATAAAA 
TTTTTTGAGA 
TCACTGCAAC 
TGGGACCACA 
GTTTCACCGT 
CCTGCCAAAG 
TAGAGGGGCC 
TACAATATAA 
GAAATTTTAT 
GTATTTCACT 
TTACTGCATT 
TAAAAATTTA 
GCTGTAAGGG 
TTAATACATG 
TTTTTCCTGA 
GACATGAACA 
GCACATTTTG 
TTTCTTTTGG 
GATTATTAAG 
TTTTTTGAGA 
CTCACCGCAA 
TTGGGACTAC 
GTTTCTCCAT 
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TCTTTATAAG 
ATTATCATTG 
TGGTAATTTT 
CCTTTAAATC 
TTTTGCAGTT 
TGACCCTGTT 
ATATACTTAT 
AGATTTACTG 
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TTACACAATA 
GGGTGCCAAG 
AGGTTAAATT 
AGGAACAAAG 
TACGTGTAAG 
CTGGACTGTA 
AAGTGTGAGT 
AATGCTCTAC 
GGGGCATCTA 
TTGGGCTGAT 
TTTATATTTG 



TATTATCTTT 
CTCCTAAGTC 
TAGAAGCTAG 
CCTAGCCCTC 

AGAATACTTG 
AGGTCTTCTA 
AATAGGTAAT 
ATGCCTACTG 
TTCACAGGTT 
CTTATATTAA 
CTGAGCCTTG 
CTCCGCTTCC 
GATACGTGCC 
GTTGGCCAGG 
TGCTCGGATT 
TCAAGCATGA 
ACATTAATTG 
CTTGTGCAAG 
TTTCTAGTAT 
TCATTGTAGG 
AACCATGTTA 
CAAAAATAGA 
TTACCAAGTT 
CTTCCACAAA 
ATTAAATCTC 
ATAATAACAT 
GATGTTGCCT 
TTTGAGGGCT 
CGGAGTTTCA 
CCTCCGCCTC 
TGGCAAGCGC 
GTTGGTCAGA 
AGTGCTGGGA 
TATAAATTCA 
TTTTACAGAT 
TTTGTTTCGT 
ATGTATGTGT 
CTGTTATTTG 
ATCATTATGT 
CTGATATATC 
GCAATACTAA 
TTTGGGGAGA 
GAGTCATACA 
TGCATTCACA 
TTGGTTAAAA 
TTTACGTGCT 
AATAGAATGT 
AATCAGAGTT 
TTTTTGGTAG 
CAGTAGGTAA 
TTTTGGTCTT 
GTTTATATGG 
AAGTGAGATT 



AAACAAAATT 
TGAGTCTGTT 
GAATGACTGA 
ACTCAAAATG 
CTTTTTTCCT 
ATGTTTCATC 
ACTCAAAACT 
TAACAAACAA 
TCAATCATAT 
TCCCGCTTCC 
ATACATTTGT 
CTCTGTCACC 
CAGGTTCAAG 
ACCATGCCCC 
ATGTTCTCAA 
ACAGACGTGA 
ACTTACTGAT 
TAATGTTATC 
TCTATGCAAA 
CCTTTTAAAT 
GAAGTAGATA 
CCATGAAAAT 
GATAGGAATC 
GCCTCCTGAA 
TGCTCTTGAA 
ATTGTTGTTT 
TTCTTCTATT 
TATGTACATT 
TATGATATGT 
CACTTGTTGC 
CAGGGTTCAA 
CACCACGCCT 
CTGGTCTCGA 
TTACAGGTAT 
TTCAATCTTC 
CAAAACAAGT 
TGTTCTATCA 
CTATTTCTCC 
GTGCATATAC 
AATATCCCTG 
ATCCAAAAGA 
TTTGCAAGCC 
AGGAGCAGGT 
TATTTAGCAC 
ATGGATAAAC 
TGAGGTAGAA 
GCCTCTAGCA 
GTGTCAAGGC 
AGGAGGGCTT 
TCTTTGGAAT 
CTTTGTTTGC 
TCAGATCCCA 
CTTATCTTTT 
CTTGCAGACA 



AAGAATGCCT 
TAGTAAACAT 
CTTTCTGGGA 
GAGTTACTCT 
TCCTTTATTT 
TAAAACAATA 
AGGATGTTTT 
GCCTTGCTAT 
TAATCCACAA 
CTCGAGTTTT 
ATGCTTTTCT 
CAGGCTGGAG 
CGATTCTACT 
GCTAATTTTT 
TCTCCTTACC 
GCCACTGCAC 
GGGTGAGAAA 
ATTCAGGACA 
CCAATATGTA 
TAATAAAAGA 
ATTGCCCTTT 
GCTTTTCAGT 
ATGCATCCAT 
GGTCTGTTTA 
CAGTGGGTGT 
TTATTTTTAA 
ATGGTTTGAT 
ATTTTAAATA 
CAGTTACATT 
CCAGGCTGGA 
GCAATTCTCC 
GGCTAATTTT 
ACTGCCGACC 
GAGCCACTGG 
ACCAAAACTC 
AATACAGTCA 
ATTATTGGGG 
TTTCGGTTCT 
ATTTAGAATT 
TCTGTTCCTA 
CCACCAGGAT 
AGGAAGAGAT 
TGGTTATTTT 
GTTTGGGGGG 
ACGTGTAATA 
TTTAGGTCTT 
GCTGGCTGAA 
GGTCCTCTGT 
CTGATAGCTC 
TTAGGAATTT 
TTAATCTTAG 
GATATTATTA 
TTGCAGCCTT 
GTGTACAGTT 
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GTTGTTTTTT 
GCACAGTCTC 
CCTCTTGAGC 
TAGTAGAGAC 
TCCGCCAGCC 
AAACTGTTTT 
AGGGCTTAAG 
GCC AG AAATC 
TAACGTGTCT 
GCTTATATAC 
TATGGGAGGC 
CTCAGAAAGA 
TCTCCTGGAT 
TCTCTACAGA 
TGGCCAAGCA 
TGATTTCCTT 
TACCCTTGCT 
CATGCCCTTA 
TAGCCAGTCT 
TCATCTTACA 
TGTTAGAGGT 
TTTGAGTATC 
TGTATTCAAT 
TATGGTTTGG 
AGTGAAGCCT 
TCAGGGTAAT 
GTGACACCTC 
CACCTTCCAC 
ACCTCCTGTA 
TCAGTTTCAG 
ATATTTACAG 
GCTAGTGGGC 
TTTTTTTATT 
GTTGGAGTTG 
CCCCAATATT 
AATTTACTGT 
ACATGCCAGG 
TTCTCATCCA 
TTTCTAGTCT 
ACATATCCAA 
ACTTTCTGAA 
AAGTATTTAT 
ACATTCTCCT 
TATTTTTCTC 
GTGCACCTCC 
TGCTATGCTC 
GCTGAACTAA 
ACATTGATCA 
TTGTATGCAG 
GTATAATCTC 
TACCTTTTCA 
AGATCATTAT 
TCAGAAGACT 
AGTTTCTCTG 



TTTTTTTTGA 
AGCTCACTGC 
AGCTGGGATT 
AGGATTCACC 
TCGGCCTACC 
TTTATGGGTG 
TTCATGAAGG 
ACTGACAAGG 
ATGTGGGAGC 
CATTTTTAGA 
AAAAGAGGTT 
ACAGATGGTA 
CTGGGGAAAG 
TGTAAAATTT 
GCAGCCATTT 
TAGACTGGTG 
CTCTCAACAT 
GCTTCCCAGG 
GTGGTATTCT 
TGACTGATCC 
TCCTCTACCC 
TTCAATAGTA 
ATGCATAATT 
ATATTTGTCC 
GGTGAAAGGT 
CAATGGGTTC 
CCCCATCTCT 
CATGATTGGA 
CAGCCTGCAC 
GGATTCCCTT 
AATAG CTCAA 
ACTGATTTGG 
GTTTTCGCAA 
TTATTGGGAA 
TCCCTCCCCA 
TTTTGAAGCA 
CGCTTGTTGG 
TGGCTCAGTG 
GAGTTTTTGA 
GGCTCTTTCC 
CCACGGTTCC 
CCTTCCTACT 
GATGAAACTT 
CACAGCACTC 
CACTACAAGA 
CCTGCACCTA 
TAATGCTGGA 
ATCTTCTTTT 
AACGTGCACT 
TTCAGGGCAC 
AAGAAAATGA 
AATTTTGAAA 
TGGGAGAAGG 
AATCAAATCC 



GATGGAATTT 
AACCTCCGCC 
GCAGCCATGC 
ATGTTGCCCA 
AAAGTGCTGG 
TATTTATACC 
GTAGTGTGGG 
CAGATTGATT 
ATTCAGAATT 
TCACAGAAAG 
TGGCTTGCAA 
AATGTTTCTT 
GTATAGAAAG 
TTCCCATTTA 
CAAAATATGT 
GCCTTATAAG 
GTTATGATGC 
TTCTAGAACA 
GTTATAGTAT 
CTCCTACATC 
AGTACAAATG 
TATTTTCGTT 
ATTAGTCAGA 
CCTCTAAATC 
TTTTGGATCG 
TCACTTTGAG 
CTCGCTCAGC 
AGTTTCCTGA 
AACCGTGAGC 
ATAGTAATGC 
TCTGAAGTAC 
AGCGTGTTCA 
ACCACGAGGC 
ACAACTTATT 
ATATCTGCCT 
CTTACTGAAA 
TTTGCTTAAT 
GAGTATAGAT 
AGCTACCCTT 
AAAATGGTCT 
TGACATTTTC 
TGGCTGGCTT 
TCCATCCTTA 
ATCACTTATC 
CAAGTAGCAC 
GAACACTCTC 
TATACATCTC 
CCATGTGCTT 
GCTATTTAAT 
TATCTGAGAT 
GCCAGTGATT 
AGGGAAGTTG 
CAAAAAACAA 
ATAGTTCTGT 



CACTCTTGTT 
TCCTGGGTTC 
GCCACCACAC 
GGCTGGTCTC 
GATTACAGGT 
ACACACATTT 
AACCATAGTC 
AATAGGTGAA 
AATTACCTAA 
AATTGGGGCT 
AGGTGGCCTT 
TTATGATTTT 
GTGAGGAGGC 
AGGCAGCTTT 
CAAAGAAATA 
AAAAGGAAGA 
AGTAAGAAGG 
GTAGGAAATA 
CACAAAATGG 
ATACACATAC 
TACTACAAAT 
AACTTTTGTA 
TGTTTTACAT 
TCATGTTGAA 
TGAGGGTGAA 
TTCACAAGAG 
TCTCACCATA 
GGACTTGCCA 
CAAAAAAAAT 
AAGAACGAAC 
CCTTTTTCAA 
AGGGTGAATT 
ATAGATTGTC 
TTCCTCTTAT 
TTTGTATGTT 
GGATTGCCAT 
TCAAGGTAAC 
TACTGATATT 
AATCTTGGTT 
ACGATTTGTT 
TGGACTTCAA 
CTTCCTTGCC 
TTTCTATTCT 
TCTACATTTT 
CGTAAGGAAA 
TGGCACTTAG 
CCTCATGAAC 
TTGTATGATT 
CTTCATGTAC 
AACTTTTTAA 
ACTGATGTTT 
AATATTGTGA 
ACTAAAAATG 
GACAGCGTTG 



GTCCAGGCTG 
AAGGGATTCT 
CCGGCTAATT 
GAACTCCTGA 
GTGAGACCTC 
AATGCAATTA 
TCTTGGCCCA 
AAGGCATTTT 
CTTCCCAATG 
TAGATTCTGG 
GTTAGGTAGG 
TAAGTGTCAG 
ATGGCTGCAT 
GCAAGCCCAT 
TATTTTGGGG 
G AC AC CTG AG 
CCCTCACCAG 
AATTTCTTTT 
ACTAAGTAAC 
ACAGGCCACA 
TATATATGTA 
GTCAAAATGT 
TCTTTCTTCA 
ATGTAATCTC 
CCCCTCATGA 
ATCTGGTTCT 
TGATATGCCT 
GTAGCAGATG 
TACTTTTCTT 
TAACACACTA 
CTTCACAGTA 
GTATTATGCA 
TTACTTTCTC 
ATTTATATGG 
TTTTGAAGGC 
CAAGTTGTTT 
TTGGATGAGA 
GTGACTGGAT 
TCAATTTTAT 
TAGGAAGTTA 
ACACATCCAG 
TTCAGGTCTG 
TTTTTCTTAT 
CATTATGTAT 
CAGGTTGTCT 
CAGGTTTTCA 
TCTCTAAATC 
T ATTG CTCAA 
GTAAGTCCTC 
CATCTCCATC 
ACGGCTATTG 
AGGGAAAGAT 
AGCACTTTTA 
GCTTAGAAGC 



GGGTGCAGTG 
CCTGCCTCAG 
TTTGTATTTT 
CCTCAAGTGA 
GCGCCCAGCC 
TTGATATCTT 
CTAAATGTTT 
ACCTATTGTT 
AGTTATAGAT 
TAAAACAGGT 
TGAAGCCTCC 
ACTCTCAGTC 
T AATG GAG AT 
TTCTGCCTGC 
TAAAATATTT 
CTGACACACA 
ATACTAATTC 
CTTTAAAAGT 
TATATTATGA 
TTTGGAACAT 
TTTTTAAATT 
CATTATAACA 
TACTAAGTGA 
CAATGTTGGA 
AGCGCACTCT 
TTAAAAGAGT 
ACTCCCTCTT 
CCTGCACCAC 
TATAAATTAG 
AGTCTATTTC 
G CTACTTGT A 
ATTAACAGAT 
TGCTCCTGGT 
AATAAATAAC 
AAGTG CCTAG 
TGCTAATAGT 
AGAAGAGTTT 
GTACTCCTGC 
CTAGCCCTGT 
G AATAG CTGT 
CATTTTATCG 
AATTCAAATG 
CCCCTTTCTT 
TTACCTTATT 
GCTTTTTCAC 
GTAAATATAT 
CTTCTAATTT 
AATCTTTATT 
CCTTCTCTGA 
ATGAATCTTG 
TTGAGGGTGA 
AACACTAGAG 
GTCTCCTGAC 
AGATTTTTTT 



Figure 9 (Page 32 of 74) 



SUBSTITUTE 5MPPT mm c oc\ 



WO 98/14466 



PCT7US97/17658 



121/162 



103681 
103741 
103801 
103861 
103921 
103981 
104041 
104101 
104161 
104221 
104281 
104341 
104401 
104461 
104521 
104581 
104641 
104701 
104761 
104821 
104881 
104941 
105001 
105061 
105121 
105181 
105241 
105301 
105361 
105421 
105481 
105541 
105601 
105661 
105721 
105781 
105841 
105901 
105961 
106021 
106081 
106141 
106201 
106261 
106321 
106381 
106441 
106501 
106561 
106621 
106681 
106741 
106801 
106861 



TTTTTTTTTT 
CGGCTCACTG 
TAGCTGGGAT 
ACTGGGGTTT 
CGCCTTGGCC 
AGATTTTTTT 
CACAGATAGA 
TACTCCATCT 
AATCTGTCTT 
TCCCGGCCAA 
AACACAGATT 
TTTAGAATTA 
AGATAAACAG 
AGGCTTGTCT 
AGGACAAGAC 
AGAGTAATAT 
CCTACCTTGA 
GACAGGAAGG 
TTTCTGTGTT 
CAGTGTTGTG 
TTCTTTGAAC 
TAATCAAAAA 
TTCCCTGGGG 
GGTTAACACC 
TATTTTTCTC 
TTATGAATCA 
TCTTCCCTCC 
CGCCCACGCT 
CAAGCGATCC 
CCCGGCTTTT 
GGGTTTCTCC 
AAAGTGCTGG 
TATTTAGAAA 
GCACGAGCGG 
TATTTTGACA 
AG C C AG ACTG 
ATGAAGGATG 
CTCACAGCCT 
AT T AC ATTTT 
AAACAAGGCG 
TTTCCTGTGG 
GCTGGCGCGC 
CTGGCGGGCA 
TTGGCCATCC 
GGTGGCGTTT 
AAGG CCAAGG 
AAAATCAGCC 
TGTTGTGCTT 
AGTGTGGCAC 
CTAGGTATGT 
AAGTTTCACA 
TCCTAACTAT 
C ATAC CATTT 
AACATTTGAG 



TTGAAATGGA 
CAACCTCTGT 
TACAGGCTCC 
C AC C ATGTTG 
TCCCAAAGTG 
ACACTCATGT 
AGTAGTAGAT 
GCTCCTATCT 
GATTTTAGGT 
GGAAAAACTT 
AACTGGAGAA 
AGACTGAAAG 
CTGTATAGGG 
GTCAAGATTC 
TCTCTTTTAG 
TTTTAGGTTT 
GGAGGAATTC 
CAGAAGGTGG 
ATGGAATGTT 
AAAAAGTTCA 
TGAGGGCACC 
TTTGAAAATT 
AATCTCATCA 
ATCTAAACAG 
CAAAATCATA 
AGAGAGCTTA 
CACTCCCCCT 
GGAGTGTGGT 
TCCCACCTCA 

ATGTTGTCCA 
TATTACGGGC 
TTGGTCGGAG 
CTGAAAGTCA 
AAATCCTAAT 
GGGATTGGGT 
CAGATTCTGA 
ACCTCCAGTC 
CTTGTGGCGA 
GTAAAGCTCG 
GCCGAGTGCA 
CGGTGTATCT 
ATGCGGCCCG 
GCAATGACGA 
TGCCTAATAT 
GAAAGTGAAG 
TAACAGCAAA 
TGGATTATGC 
TTTTAGTAAT 
GGGAGAAGTG 
CACAGCAGTT 
CTTGAATGGA 
GCTGTAGCAA 
TATGTATTTC 



GTTTCGCTCT 
CTCCAGGGTT 
CACAACCACG 
GCCAGGCTGG 
TTGGGATTAC 
TTCTTTTTCC 
ACCTCAGAAA 
CATGGAATAT 
TCCTCAACAG 
CCCCTTTGCC 
AAGGCATATA 
ATACAGGGGA 
TACGATCTAA 
TTCTTGACCT 
AATGGGGGGT 
TATGGCTGGT 
TGGTTTCTAT 
TCAGTGAAAC 
TGTTCTCTCA 
GGAAATGCAA 
TAGGAAACAG 
AAAAAAAAAT 
AC C AG AG AAG 
ACTTTGTCAC 
TACTCTCCCC 
TAAGCTTCTA 
CCCCTTTTTT 
GGCTCTATGT 
GCTTCTCGAG 
TTTCTCCCCC 
CGCTGGTCTC 
ATGAGCCACT 
TCCACTCCTT 
AAATAACCAG 
TCGGCCAATT 
CAAACATAAA 
TTTCCCATTG 
AGTATAAATA 
TTTTCCCTTC 
CGCCAAGGCT 
CCGCCTGCTC 
CGCGGCGGTG 
CGACAACAAG 
GGAGCTTAAT 
TCAGGCGGTG 
AGTTAACGCT 
GGCTCTTTTC 
CGCCCATAAA 
TTGTCCTGCA 
CCATGCAGCA 
AC T AC ATTTT 
AGTGTTAAAA 
TTAATGGCAT 
CCAAAATGAG 



TGCCCAGGCT 
CAAGCGATTC 
CCCAG CTAAT 
TTACGAACTC 
AGGCATCAGC 
TTCTGTCATC 
TTCCTGGAAT 
AAAAGGAAAA 
GAGAGCCAGA 
CTCCCAAGGT 
TATTTATTTC 
AATTGCCCAT 
TGCTAACAGA 
CTCAGTGCAG 
CTTATGACCT 
TCTAGGGAAA 
GGCTAGACTT 
ACTTTTATAA 
TTTCCTGAAA 
CTCAAAAATG 
TAAATTCAAG 
TCAAAAAGGA 
ATTAACTGTA 
AGCTGTCACC 
TAAGTTGCCT 
CAGTTCACTG 
TGTCTTTGAG 
GAACTCACTG 
TAACTGGAAC 
GTTTCTTTTT 
GAACGCCTGA 
GCGCCCGATT 
TCCAAAAACA 
AACAAAACCT 
ATTATTAGTA 
CCTTACACCA 
GGTATTTGAC 
CTTCTCTGCC 
TTATCAGAAG 
AAGACTCGGT 
CGCAAAGGCA 
CTTGAGTACC 
AAGACCCGCA 
AAACTTTTGG 
CTGCTGCCTA 
TCATGCACTG 
AGAGCCACCT 
GATGTTTTTG 
GAAATTAGAT 
CAAAACATGT 
AGAGGAAGGA 
CCCGCATGCC 
ACACAATTGA 
CTTTTTTCCA 



GGAGTG CAGT 
TCCTGCTTCA 
TTTTTGTATT 
CTGTTCTCAA 
CACCGTGCCC 
CTGTTTCAGT 
AATTAATCCA 
ACACCAAGAT 
CAATGGCTGT 
TTATGGAAAA 
ATCACAATTT 
TTTTATGCTT 
CTGAGTGGGG 
CATTTCTTCC 
ACAGGCAAAC 
AGGAGTTCTG 
TGGGGAGAAT 
TCATAATCCC 
GATTCCAGAG 
TGCCACTTTG 
GAAGGGCTTT 
ATTTAGTTGT 
TCACAGGAGA 
TATTCTTTGA 
ACATCCCCCT 
GGATTTGGGG 
ACACAGTCTT 
CAACCTCCTC 
TACAGGCGTG 
TGGTTATTTT 
CCCGCCGTCC 
TGAAGGACCT 
TGAGTCACAA 
CCACTCATGC 
TTCAAGTCGA 
GACGGAAGGA 
ATTAGCCAAT 
TTGCGTTCTA 
TAGTTATGTC 
CTTCTCGTGC 
ACTACTCCGA 
TGACCGCCGA 
TCATCCCGCG 
GGCGTGTGAC 
AGAAAACTGA 
CTGTTTTTCT 
ACGACTTCCA 
AGGTGTTTTT 
CCATAGAAAC 
TTACAGGGGT 
AATTATACCC 
CCACACAAGT 
GAGCACACAC 
GTTTGGGGAT 



GGCACGATCT 
GCCTATGGAG 
TTTAGTGAAG 
GTGATCTGCC 
AGCCAGGAGC 
ATAAGCAGAC 
CGTTCATCTG 
TTCCCTAGGC 
AATAATATTG 
TTACTGGCAA 
TACAGGAGAT 
AGGTTCAACA 
AAGCCCCGCA 
TTCTGGTTAT 
AAGGTAGGTT 
GTTTGTATGG 
GGGACTTACA 
ATTTTGAGTA 
ACTCCTCATT 
TTACG CTGAT 
CGCTGAACTC 
TAAGATTCAC 
GGAGACTGGT 
AACACCCATT 
TCTTTCTCCC 
TATTCGCTTT 
CTGGCTCTGT 
CTCTCGGGTT 
CACTACCAAG 
ACTGGAGACA 
TCGGCCTCCC 
CTTAAATATC 
TCCGGGAAAA 
TTAAAAAAGG 
AGGCTCGTCA 
TTACATGCAA 
GGGAGAATTC 
ATGTAGTTTC 
TGGTCGCGGC 
AGGTTTGCAG 
GCGCGTCGGG 
GATCCTGGAG 
CCACCTGCAA 
CATCGCGCAG 
GAGCCATCAT 
GTCAGCAGAC 
TTAAATGAGC 
AATGGCTTTG 
CTCAGGAATT 
GATTCGCGTT 
ATGAGTGCAT 
TTGAATATGT 
ATTACCACTG 
GTTTTGCTTT 
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GTTTTGGGGT 
CTGTAACCTC 
ACAGGCGAGA 
CGACACCCAG 
CAATATTCTC 
CTTTGTGTGG 
AGTGGATATA 
GGAGCCTCTC 
ATTTTTGAAT 
CTTTCCTTTC 
GTACTAAACA 
CCAAGGCTGC 
GCCCGGGCAC 
TTCGGAAGCT 
TTCGCTTCCA 
GGCTCTTTGA 
AAGACATCCA 
TCATCAGTCT 
GCTGTGATAA 
TTAGACTATG 
GGTTATTTCT 
TCAGGCCTCT 
AAATCTGGTA 
AGAAAATTGT 
GCTCAATACA 
ATATATATAT 
GGTGCAAAAT 
CTTTAAATCT 
ATGTGTGGCT 
ATGCACCATG 
TAGAATTTAC 
TGGGGATTGA 
AATATTTACT 
TTTTACAGAC 
AGACCTTGCC 
GCCTACATCT 
ACAACAAATC 
GTGTTCCAGC 
TTATGAAAAT 
GACTCTAACT 
ATGGACTTGG 
TATTTTTTTT 
TTCTTTATCT 
AATAAAGTTA 
ATTTTCCACT 
TAACTCTTCA 
AAAGAAAATT 
AAGAATCTAT 
TGTTGTTTGC 
CTAAAGGCAC 
AACAGGAGAT 
TTCTTCAGCC 
CGAGAGACTT 
TCATATGGTT 



GGAGTCTCCC 
GAACTCGGGC 
GCCGCCACGC 
AAAAATACAA 
TGATTTCTTT 
TTGTAAATTT 
GCAGCTAAGG 
TTAATCTGCA 
TTTCTTGGGT 
CTCCACAGAC 
GACAGCTCGG 
TCGCAAGAGC 
TGTGGCTCTG 
GCCGTTCCAG 
GAGCTCTGCG 
GGACACAAAC 
GCTCGCTCGC 
TAAAACCCAA 
TTTTTTGTTG 
GTCTTAAAGT 
GACCTTATTA 
AGCTTGCTAT 
AGTAGTTAAC 
GTCTTGCGAG 
TAGTCCCCTA 
ATATACTGTT 
GTGAGGCAGG 
GTCAGTCTGT 
TTGCTTGTAA 
ACATGCCACA 
AAGTTTTAAC 
GTACTGGAAG 
TTAAAATTTT 
TAACTTTAGA 
TCACATTCTT 
AGAATGTAAA 
ACACACACAA 
TTTTAATAAG 
GAATATGTCA 
GGCATAGACA 
TCTATGCCAA 
CCAGTTATAG 
GTAGGAAACA 
CATTACTGTC 
CCCTCACTTA 
TTGACAGTTG 
TATTGAGCAT 
TGTTTTGTAT 
AGAATATACC 
TTCAAAAACA 
AAAAGTTCCA 
CAGAGGCATA 
CTATACACAA 
TACTTTCCCA 



TCTCGCCCAA 
TCAAGCGATC 
C CGG CTAAG A 
TTTTAAATAA 
TTTATATTTT 
TAAGACTT C A 
GGTTAACAAA 
ACCAGGCACA 
CCAATAGTTG 
GTCTCTGCAG 
AAATCCACCG 
GCGCCGGCTA 
CGCGAGATCC 
CGCCTGGTGC 
GTGATGGCGC 
CTTTGCGCCA 
CGCATTCGCG 
AGGCTCTTTT 
TCTTAACAGA 
TGATTAACAG 
AGGTGCTATT 
GATTAGCATT 
TGGCGCTTAC 
TTCCAGTGTC 
GGTTTTCTCA 
AAATTCATTT 
GATCTAACTG 
CGACCAAGCA 
ATAGTCTATC 

^l**!. 1 *.!* *r 
CATTTTCTTT 
AAAATTTAGA 
TATATTTTGT 
AC AAC C AC AG 
TTTTACAATA 
CTGATGTACC 
AAGATCAAAT 
GCAGTTTTTG 
GTTTGTTTTA 
TTTGTTATCC 
GGTGACTACT 
ATGTGCTGGA 
AATGTGTTGG 
TGAGGATCAG 
CATTCTTTGC 
ATATTTAAAA 
TTTGTATTTG 
TAGAGGAGTA 
ATC CAAAAAT 
GCATTCAAGA 
ATGTGAAAAA 
GATGAGATAA 
ACAAACCTTG 
CAATTGCCTC 



GCTGGAGTGC 
CTCTTGACAG 
GCATTTTTCT 
AGCGCATATG 
AACTAGAAAC 
GGAAACTTTT 
ATGACGTCAG 
GAGATGGACC 
GTGGTCTGAC 
GCAAGCTTTT 
GCGGTAAAGC 
CCGGCGGCGT 
GCCGCTACCA 
GAGAAATCGC 
TGCAGGAGGC 
TCCATGCTAA 
GAGAAAGAGC 
CAGAGCCACC 
ACAAATTTCT 
AAATAACGGT 
TGGAGAGAAG 
TGTTTAAACA 
TAGGCATTTT 
TTCCTCAAAA 
TATATTATAT 
GGCTGTTAAC 
GCTCTCATTT 
TAATTTAATC 
TGGTTGCATT 
CAGTACTTCT 
CTGTTGATCT 
GGGATGGGAA 
ATTTTTTTAT 
AATGTCCAAC 
AATATTTTTT 
ATACTAAAAT 
TTGAATTGCA 
GTTTATAAAG 
TGATTCGTTT 
ACAGACAGTA 
CACAAGCTCT 
TCTGATGTAT 
AGGTACTGGG 
ATGGACAGGG 
CCCCTCCTCA 
ATTAACGAAT 
TGAGTAGTGC 
ATTAAGGAGA 
AGACCACTGT 
AGGGAATTCT 
TGCTCTGCTT 
TTCTGCACAA 
TTAAAATAAT 
TCTTTAACTT 



AGCGGCGTGA 
CCTTCTGAGT 
AATTGCCCAC 
CAAATTTCCC 
AATTGGAGGT 
CCAGTACAAG 
AGTAGCTACG 
AATCCAAGAA 
TCTATAAAAG 
CTGTGGTTTT 
GCCACGCAAG 
GAAAAAGCCT 
AAAGTCGACC 
CCAAGACTTC 
TTGTGAGGCC 
GCGAGTGACT 
GTAAATGTAA 
CACTTATTCC 
AAGGACCCCC 
TTGGTCAGTC 
CTGTGTAAGT 
ACTTTGTAAG 
TGCAAAGCTT 
TGCTTAGGAA 
ATATATATAT 
ATTAACCTGA 
TATCCATAGC 
CCTTATATAT 
GCTTTGTCTC 
TGCCTGTAGT 
TGCTTTTCGG 
TACTGTACGC 
CATATAGCTT 
ATTAAAACTA 
ACACCTAACA 
CGCCTGACCA 
TCGTTTACTT 
TAATATTTGC 
TTCTTGACTC 
TAGATATGTT 
GGGCCCAGCT 
AGCGCTTGAC 
TCTGACGAAT 
GGTGGTAGCT 
ACAGAACAAG 
GGATGAAATT 
AAACATTTTA 
GATTGGAGAC 
GGGATCAGGA 
TCTAAACTTT 
GTACCAGGTG 
ACACAGCAGG 
CATATATTCC 
AATGTGAAAG 



TAACAGCTCA 
AGCTGGGATT 
ACTTCTTATG 
TAATCGTCTC 
TTCCGCGTTG 
ACTTGTCCAC 
GTAATGGGCA 
GGGCGCGGGG 
AAGAGTAGCT 
GCCATGGCTC 
CAGCTGGCTA 
CACCGTTACC 
GAGTTGCTGA 
AAGACCGATC 
TACTTGGTAG 
ATTATGCCCA 
AGTTACTTTT 
AACGAAAGTA 
CCGGAAAGCA 
TTGCAGTGTA 
CCACTATCAT 
AGTAAGGGAA 
TGAAAAGATT 
GATTTTCTCA 
ATATATATAT 
AATTTATTCT 
TAGCTACCCA 
GAATTTTTAT 
CTCTAGGACT 
TATTAAAATC 
TTTTGGAGGT 
AAACAAAAGT 
TTACATCACA 
CTAATTCCAA 
TTCTTTCTTG 
ACTGTCAACA 
AAATTCATTT 
ATTTTAAAAA 
TTATACAAGC 
AGAGATGCCA 
GAAGGTCAAG 
TTTTTATATT 
AGCATAAAAG 
CAGTCCAGCT 
GATTCTGCTG 
CTCATTTGTG 
ATATTATATT 
AAAAAGGGGG 
TTCTTTTGAG 
TCTTTCTGAA 
AAAAGACATA 
GAGTCATAGC 
TTTAATCTCC 
CATTTAGCTT 
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TTGCCATTTC 
TTAAATACAT 
TCCAGCTGGA 
GTGTGCATTA 
TAAG C CATC A 
GAAACACAAA 
AATTCCTTCT 
CACCCCTGCC 
CTTATAAAAC 
CACCCAGGTA 
ACGGACGCGC 
TATGCTGCAG 
TTAGATTTGG 
TTCAGACTTA 
TGTTTTCAGG 
AGACTTGGCT 
AAGGAATATT 
ACACCTTGTA 
TAGTTGCTGC 
TATCCCAACT 
AATATTTATT 
TCCCCTGAGT 
CCCTCTTCTG 
AGGGCTCTGG 
TATG CATTTA 
CCAAAAAAAG 
CGCTAAAGCA 
GCTAGTGCAA 
TTGCTTTTTT 
AAGAAACAGG 
TTTAAGTACC 
CACTCCGGCA 
GTATAAATTC 
AAAGTGAAAC 
GCAAAAAGTG 
TCCGATGTTC 
GTCAGCTGGT 
GCTAAGCGTC 
CGGCGCCTTG 
CGTGGCGTTC 
CACGCCAAGC 
CGCACTCTGT 
TAGGGCCGCC 
TCAATGGCTT 
AGACCAGGTT 
TCCGAGATAA 
GTGCAGCGCG 
GTTATTTGTA 
GGTGTCAGGT 
GCTTCCCTGG 
ACTTCCATTT 
AGTAGGACTG 
AAGCAGGCTG 
TAAATAAGGA 



TTTGGGGCTT 
TTGTATGCTT 
GACCCTAACA 
AATGTTGTTT 
TATCCCCTGT 
AGAAGTGAAA 
CCTGGCTCAT 
AGC CAGAGAA 
GGACCCACCC 
GAATAAACAG 
CTGAAACAGT 
GCTGAAGTTT 
GTGAGATGGC 
AAAATTTAGC 
AAATTTAAAA 
ATAGGAAAGC 
TTCGGACAAT 
CGTTATTACA 
TAGCAATATA 
CTGAGCAGAT 
TCTTTTCCAC 
ATCATCATAA 
AATGAGACAG 
CTGGACCAGC 
TTTAACACAG 
CAGTTAGCTA 
AAGGGATTTA 
GGTTTGTACA 
TAAACTACAG 
TCAGCTTTCA 
TTTGAATAGT 
TATTAGTGAA 
AGATTCAAAT 
TATTGTGTTC 
AGGCAGCAAC 
GTGTATAAAT 
TAGACATGTC 
ACCGAAAAGT 
CTAGGCGTGG 
TCAAGGTGTT 
GCAAGACTGT 
ACGG CTTCGG 
CACTCCCTCT 
TACTCGG CTA 
GTTTGGTCTG 
GTG AG CTAT A 
AGTCCGATAA 
CGAGAGCCTA 
CTGTTTTCCC 
TAACACCTGC 
CTAGTTG AG C 
GCGGTTAAAG 
CCTACGCAGT 
CTCAAATTTC 



CACTTTTTTA 
TCATTCTGCT 
GAGTAGAGGT 
GTTCCCAGTT 
GAACTGCACG 
ATGCCCTGTT 
CCTGACTCAA 
CAACCCCCTT 
CATCTCCCTT 
CCTTGTTGCT 
TTAACAGGGT 
ACAGCTAATG 
TAAGATTCTC 
AGTAGCCCTT 
CAACAGTGAG 
ACGAATGCTG 
TTTAACATGT 
GGCTGTGATT 
GTGTTGGGAG 
TTTTTTAAGT 
AGTCTCTTCT 
CCCGATTTAC 
TACAGTGTGA 
AAATTAAATT 
GTTTTACGTG 
CTTATATAAT 
GGC TAGAATA 
GAATTCTCTT 
TGAGAACATC 
AGAAAACATA 
CAATATGTCT 
AGCCCTTAAT 
AGTAGTGTCG 
TCCCTCGCAG 
GCCTCCTTAT 
GCTCGTGGCT 
TGGTCGCGGC 
GCTGCGGGAT 
TGGGGTTAAG 
TCTGGAGAAC 
CACTGCCATG 
CGGTTAATCT 
C AG AAAG AG C 
TTCTGCCTAG 
AGTGGCTGCT 
AACTTCAATG 
ATGAGTAGCT 
AGATGCTAGC 
AGGCGTATCT 
CGTCCTTAAC 
TTGGCGTCCT 
CTGCTTTGCT 
TCGTAAATTC 
TTCTGACTCC 



TGAGGGTTCT 
AATCTGTTTT 
AAAATTTTGC 
GTTCAGTTTG 
TATGCCTCTA 
CCTGCCTTAA 
AAGCTCCCCC 
TGACTGTAAT 
CGCTGACTCT 
CACACAAACC 
TTTTCCTGCC 
CTGTTGAAGT 
AGAGAAAGAA 
GCAGTTTTTC 
AAGCGTGTAT 
CTATTGTATT 
CACATATGAA 
TTAAAAAAAC 
TAAAAACACG 
AGTAACATCT 
CATGCCTCGT 
AGATGAAGGC 
AGGATAGCAA 
AATGTAAAAT 
ACACAGGTGC 
GAATTGGACA 
TATAACTGTG 
GGCCTCAGCC 
TTTCATATGA 
AGGCCAGAGT 
TCAAGCACTT 
ATAAGCCCTT 
TAAACGGGAG 
TCCTTAGGTC 
CCTCGCTCCC 
TGCTTTCTTT 
AAAGGCGGTA 
AACATCCAAG 
CGAATTTCCG 
GTGATCCGGG 
GATGTGGTTT 
TTTCGTCAGT 
TGTGATTGTA 
TATGTAGAAC 
AAAGCAGAAA 
C T ATAGTTTT 
CAGCTTTTTA 
TGCCTGGAAC 
GACTTAACGT 
CGCCCCCTGC 
GCTGAGTGAC 
ATTTTCAGTC 
CCACTTAGTA 
GAGGTCCGTG 



CCTGTCCCAT 
ATGGCAAATG 
CTCCCTACAA 
TCAGGCCTCT 
GATGGCCTGA 
CTGATGACAT 
ACTGAGCACC 
TTTC C ACT AT 
TTTCGGACTC 
CTGTTTGATG 
CAGTCACAAC 
CTAAAATCAG 
GTCAAGTTTG 
CAATAGAAGT 
GGAGAGTTGA 
GCACCTTGGA 
AAGCTAAACG 
AATCCTTACT 
AAAATGAGAG 
AAAATTAAAC 
TCACATTAGC 
ACGGTTGCAA 
AACTCCACTC 
GGATTAACAG 
TCTCATAAGG 
ATTAGTAAAA 
TAGAGAAGCG 
TCCTATCCTT 
GAATTTCACC 
GATCTTTTCA 
G AAAG AC TT A 
ATTAAAATTC 
GGAAAAACTA 
ACTGCCCCTC 
GCTTTCAGTT 
TCGCGTACCT 
AAGGTTTGGG 
GCATCACCAA 
GTTTGATTTA 
ACGCCGTGAC 
ACGCGCTCAA 
TTTCTTCCAA 
TTCTTTCGGA 
TATTATAAAC 
TCAGCTAAGT 
GACATGTCAA 
GTTTTAAAAA 
TGAGTAGGTG 
CAGCAAAAGC 
CGGTAGCGCC 
GTCACCTCCC 
CTCAGGCTGG 
GACTAAGGGA 
GCAGCAGCTA 



AAAATTTACA 
AATTATCAGG 
GATAGAGATT 
GAGCCGAAGC 
AGTAACTGAA 
TACCTTGTGA 
TTGTGACCCC 
CTACCCAAAT 
AGCCCGCCTG 
GTCTCTTCAC 
AAAGTGATGT 
TTTTGGTTTG 
GGGTGCATTT 
GATTTACGAA 
ACTACACTCC 
AAAGAGAACA 
GAATCTGTCA 
AATACATACA 
TTCAGGACAA 
CATATTATGT 
TAATTAAAAG 
TGAGCTATCA 
CCATCCTCTT 
GAGAAAGGTA 
TAATGAAAGC 
TGTAAAAATG 
CCCAGCAAGG 
GAGAAGAATG 
TACTGCTTCT 
CGCCTGCTCT 
AAAAGTTTAC 
TCAGTCGAGG 
AAGGGATTAA 
GAGGGGCGGA 
CTCAATAAGG 
GGTTTTTGTT 
TAAGGGAGGT 
ACCGGCCATT 
TGAGGAGACT 
CTACACGGAG 
GCGTCAAGGA 
TGGCCCTTTT 
TGGTAACATC 
CAGTTGGGAG 
AAACGAGGTC 
GCAACTTAAC 
CGAGTTGTGC 
GATTAAAATG 
TGTACTTTTA 
AGAAGCCTTT 
CCTTCTGTGG 
AGGCTCCCCT 
GTCTGTTTTA 
TAAGATGGAA 
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1134 01 GCCCCCTCTG ATGTAAGATT CTCAGATGAC TTGCATCTTC ACTGTACCTG TCAACCCAAT 

113461 AGTCTTCTAT TCCTGCCTTA AATTGTAAAT TCCAAAACTG ATTTAATTGT GAAAGTTTCA 

113 521 AACTGTACGA CCTAGGAAGT GTCAAAGTTA GGTGACCAGA TTTTTAGAAG TCAGCCAAAT 

113 581 ATTCAGCATC TTTGATTTAG TAACAAATAT ATTGATGGCT ACTTCAGCAA AAAAAATCAA 

113641 CTTTGTTTTC TGGTTACTTT GCTAACAAGC TTCTCCTGAC AGGAGGATAT AGTGAATAGG 

113 701 CAGTTGAATA AGTGAGTTCG GGTGAGAGGT CTGAGCTGGA GATAAAAATG TGTGAGTCAT 

113761 CAGCAGATAA ATAAATGCTG AGACCAGATG AGATGG CTAA AAACTGAAAC ATAATGTAGT 

113 821 GCAGCATTGT TTGTAATAGT AAATGAGTGG CAACTGTAAA GTTTTCATCA GAAAGGACTA 

113 881 GAGTGATCTA TACATCCATA AAATAGAGTA TTTCTCTACA CAGCCCTACT AAAGAATGAG 

113 941 AAAGCTGTAC TCCACTACAT ACTCTGGTGT ACTCTGGCTC AGTTCTTGGA CTCCTCTTTT 

114 001 CTTGGCTAAC TCAACTGGCC TCACCACTTA CATGCTCTGT GCTCTGTCAA ATAGTTTGTT 
114 061 CAACAGAACA CCACGGCCTA GCTGTAAGTG CCACGTTAAC TTCTAGCAAT GCCAAAGCCT 
114121 GTGATAGTGG CAGCTTCGGG CTGTTTCTCA TTCCCGGGAT. GCCTAACCAC CTCTCCAAAT 
114181 TCTATCAGTT TGCTTCCACC CACTTCAAGC TTCAGAACGA AAC AT AG AG C TTAAGAAATA 
114241 TAGGCCCGGC AAGGTGGCTC ACGCCTGTAA TCCCGGCACT TTGGAAAGCT GAGCCTGGTG 
114 3 01 GATCACCTGG GGTCAGGGGT TCGAGACCAG CCTGGCCAAT ATTGTGAAAC CCCGTCTCTA 
114361 CTAAAAAAAA AAAAAAATTA GCTGGGCATG GTTGCGGGCG ACTGTAATCC AAGCTACTCG 
114 421 GGAGGGTGAG ACAGGAGAAT AGCTTGAACT CGGGAGGCAG AAGTTGCAGT GAGTTGAGAT 
1144 81 CGCGCTATTA CACTTAGGCC TGGGAGACAA GAGTGAAACT GTGTCTCTAA ATAAGTGTTT 
114 541 GCAATTATAA ACCATCTCCC TGACCTTAAA TCTCTAGACT CATATACAAC TGCATATTTG 
114601 ATGTATCTAA TTGAATAATG GGCATCTCGA ACTTGTCCAA AATATGTTTA TACGTAAACA 
114 661 CCAAGTCTGT TCTTCCTCTG ATATTTGTCA TGTCAATCAA TAGAACTCCA TTCTTCAAGC 
114 721 AGCTTGGGCC AGGAATTGTG CAATATTGTT TGTCCTGAGC TTCTTACAAC TTTCACCCAA 
114781 TGCAGTCAGC TCTGTTGAAA ATCAATCAGA ATACCTTTCA TTGTTTTCTT TGCTGCTTCT 
114841 CTAGGAGCAA GCTGCCATGG CGGTTTGTCT GAATGACCAC AGTG AC CCC A AACTGGTCTT 
114 901 TGTTTTCACT TTTAATCCCC CTGTCATACA GTTTTTCTCT ATC C AG CATC AACAGTGATC 

114 961 CTTTTTGAAG GTATTATGTC CACTGTCTGC TGAAAAGATT CCACTGGCTT TCCATCACCT 
115021 T C AT AAT AAA AACCAGCATC CTTATCATAG CCTACAAGTA AGATGACCAA CCATTACAGT 
115081 TTGCCTGACT CTCAGGGGTT TCTCAGGGTG TAAGACTTAC AGTGCTGAAA CTTAGAAAGT 
115141 TCCAAGCAAA CTAGGATGAG CTGCTCAACC TACTAGATCT GTACTCTGGC TACCCTCTGA 
115201 CCTCATTCTC TTCGCAGTTC TTTCTCTTCA CTGACCTTGC TGTTTCTGGA ATGGACCAAG 
115261 CATTTCCAGC ATCAGCACCT TTATATCTAT TCTTTCTCCC TAGAAGGGTC TTGTCCTGGA 
115321 TATCTGAATG GCTCTAGATC TCATTTCATT CAAGCCTCTC CTCAAATACC AACCTTAAGA 
115381 AAGAGACCTC CCATAATCAT CCCTTGTAAA ATAAGCTTTT CTGCTCATTT AGCATATATA 
115441 TATATAGTTG ACTATCCTCA ATAGCATATA TATATAACAT TTCCCCACCT AGAATTATAT 
115501 ATGTAATAAT ATATTTAACA AAAAATACAT ATAACTAGAT ATATTTTATT TTGTGTTTGT 
115561 TCTCTCTCCC CCAACTGGAA TATATTTTTT GAAGGTAGGG ACTTTGTTTT GTCCCAGAAG 
115621 TATCCCTAGC ACCTTGAACA GGG CTGACGT TTAACAGGTA GTTTATGGAG GTTTGTTGAA 
115681 TGAAAGGATG TGTGAATTTT CTATGTAAGT CTCCAGGCTC TCCACTAAGC C C AC CAG AAT 
115741 GCTAACACAA TCAATTCCCC ATCTCATTCC TTGACCTGCC ACTGCCTGAA GCAATCAGCG 
1158 01 TGCAGTTTCT CTTTAGAAAA TCTGGGGGAT AGTCTAGGGG TTGCAAATTA AGCAACATTA 
115861 TCTTTGTTCT GAACAAGGAC TGCATGAGTG TTAGGACTGA AGAAGGCCCA AGGTGGTGGT 
115921 GGGTATGCCT AAGATGAGTA TGACATATCA GCAATGCTAT GAACATAGCA ATGCTATGAA 

115 981 AGGCCAGGCA AAACGTAACA GGAGCTAGTC GTGGCTTATT GTTACAACGA CTATACCTCC 
116041 CATATGGGTA ATCGATATCC ACACACCCCT CTACATTGAC TCTGGAATTC AGGAAAGGGA 
116101 ATTAAAATTT TCTAACTTAT GTACCCCAAT GATTTCAACA ATATCTGGCA TATGAGATCA 
116161 ATAAATATCT TTAAAATACC AACTAAGAAA GACATAAAAT GACCCACCCT CCATACCAGG 
116221 CTCATTTTTG CTCCTCTGAT TCCTGAAACT ATCCAGAATG CAGCTATGAA TTCTCTCCAT 
1162 81 TGTCAGTTTT AAATTAAGCC AAGCTGGGTA CTTGTGTAAT TCCTCAAGAA ATCCTGGATG 
116341 AAAACTGTCA GGTGGAAAAC AGGACCTCAA AATAAAGAGA CATCCATCAC TGAAGCTAAC 
1164 01 ATCGTGAGGC TGAAATCAGT CCTATAACAA TGGTACCAAA AAGAGCACAA TGAGAGGCAT 
116461 TTGTGAATAT TTACTCAGAT GAGAGTAAGA TATTTCCCTA TCAGCTAACC TGAAGTTCAC 
116521 ATCCCTTTTC CAGCTGAGTT CTGAAGCTAG ATGTACTTAA CTGGAACACA TAACTGCATC 
116581 AGGAACATCC TTTAAAACTA TGGCTACAAT GGCTTGACTG GACAAACCCC AGGCTTCCAG 
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116641 GTTTAGCACA GGTGGCCCTT CACAGACCAA CATTGCCTAT GCTACCAACC TCATGTCCTA 

116701 CCACCCTGCT TGCATCATTT CTCTCTCTGC ATATATAAAA ATATATGTGT ATGTATATAA 

116761 TCAGCTTTAT TGATATTTAA TATACCACAA AATTTGCCCA CTTTAGGTAC AGTTCAATGA 

116821 ATTTTACCGT GTTTTCTTAG TTGTACAACC ATCATCACAA TTTAATTTCG GAATATTTCT 

1168 81 ATCACCCAAA TTTCCATTTC TGCGTAAAGG GGGAAAAAAA AAGGTTAACT GCTGAAGGCC 

116 941 GCGGTAACAC TGAAAAAGGT GCCTTTTCTC TCTAAAACAG ATTTTAATCT CCCCTGAATT 

117 001 TAGTGTCCTG GGTATTCCAG GAGTCTGAAT AGGGTTTCAA TTTTCAGGGT CTTTTTAATA 
117061 GAGTAAAACT GTATTGGTGG CGATAAATTT AGTATTGCTC TCAGTACATG ATTGAGGGAT 
117121 ACTTAAATGT CTCTGTGATT TTATTTCATA ATCGCTAAAA GATGGTTTTT TTTTTTCCTA 
117181 AAACAGGGTT TTTGTTTTTT CTCAATAAGC TTCTTAGCTT CCCCTCCGGC TCCCTGGCTT 

1172 41 GCCTCAGGAA ATATTAG CTC ATCAGTTCTG ATTGGTTGAC AGCTACGAAT GGCCCTCATT 
117301 GATTGGGCAG CGCTTCTTTG TCCCTTGGAA ACTAATACAA ATTTTTAACA CTACTTTTTT 

1173 61 TCCACTCTTT CTTCAGAGTT GGAATATCGT TGCTCCCCTA CCCATATGTA GTGAGTGGAG 
117421 GGCAAACTTG GAGTTCCCCT AATCTTTCCT TTTTAGGATG TCAGCTCAGT ATCATTCATC 
117481 TTAATTACAC ATTGAGCTTC TTGACTTAAT GGATACAGCT CTTCTTTTGT TTAGTTGGGC 
117 541 GGCCCTGAAA AGGGCCTTTG GTTCAGAAAT GCAAGCTGTG GAGAAATCAG CAACCTTAAC 
1176 01 CGCCAAAGCC ATAAAGGGTG CGTCCCTGGC GCTTAAGCGC GTAGACCACG TCCATGGCAG 
117 661 TGACTGTCTT GCGCTTGGCG TGCTCCGTAT AGGTGACAGC GTCACGGATC ACGTTCTCCA 
117721 AAAACACCTT GAGCACCCCG CGAGTCTCCT CGTAGATCAG ACCAGAGATC CGCTTCACAC 
117781 CGCCACGCCG GGCCAGACGC CGGATGGCCG GCTTGGTGAT GCCCTGGATG TTGTCACGCA 
117841 ACACCTTGCG GTGGCGCTTG GCACCCCCCT TACCCAAACC CTTCCCGCCC TTACCACGTC 
117 901 CAGACATGAC TTCCCAAGAA GTGAACCAAG AG C AAGTGAG AGAATAGGAA ACCGATCTTT 

117 961 ATATATCTAC GTTACCCCTG CCCCCACCTC CAGCGGACAC AGAGACTGAA AAGCGCGCAG 
118021 GCGGGAAATG TGACGCCTAC AGTCCGCTCC TTTAACCCCT CCTCCAAGCC CCAGGAAATG 

118 081 GCGGGAGCAG CGATTGGGGG AGGGTGGGGA GATGAGGGTG GGACCAAGCA GGCTTGACCA 
118141 ATGGCCTTTA TTTTCTTAAC AG AG CTAC AG GCTTTGAGGA ACTGGGTTAA GAATTAAATG 
118201 TAAACCCATT CTGACTCCAG AATTATTTTA AGTCGAACTT TTTTTTTAAC CGAATCTCTC 
118 261 TGTCGCCCAG ACTGGAGTAC ATTAGAGCCA TCTCGATTCA CTGAAACCTC TGCCTCTCAG 
1183 21 GTTCAAGTGT TTCTCCTGCC TCAGCCTTCA GAGTGTACCT GGGATTACAA GCGCTCGCCG 
1183 81 TCGCGCCCGG CGTGTTTTTG TATTTTTCGT AGAGACGGGA TTCGGCCATG TTGGCCAGGC 
118441 TGATCCCGAA CTCCTGATTT CTGGTAATCC GCCCGCCTCA GCCTCTTAAA GTGCTTGAAT 
118501 TACAGGCGTG AGTCACCGCG ACCGGCCGAA ATCGATTGGT TTTGAAGCCT TCAGTAG CAT 
118 561 TAAAACGAAA AGTGCTCCCA ATGCATTCCC TTTTGTCTTA AATTGGTTTC TTAC AG CTAC 
118621 TTTACTTGAA AAGGTGGTGG CTCTGAAAAG AGCCTTTGCT TGGACCGTCA GAGAGACCAC 
1186 81 AGTAATCACG CCCTCTCTCC GCGGATGCGG CGGGCGAGCT GGATGTCCTT GGGCATGATA 
118741 GTGACGCGCT TGGCGTGGAT GGCGCACAGG TTAGTGTCCT CAAATAGCCC TACCAAGTAG 
118801 GCCTCGCACG CCTCCTGCAG AGCCATCACA GCGGAGCTCT GGAAACGCAG GTCTGTTTTA 
118861 AAGTCCTGCG CAATCTCGCG CACCAGGCGC TGGAAAGGTA GTTTACGAAT AAGCAGTTCA 
118921 GTGGACTTCT GATAACGGCG GATCTCGCGC AGAGCCACGG TGCCCGGCCG GTAGCGGTGG 
118 981 GGCTTTTTCA CGCCGCCGGT GGCCGGAGCG CTTTTGCGGG CTGCCTTAGT GGCCAACTGT 
119041 TTGCGTGGCG CCTTGCCACC AGTAGACTTC CGAGCAGTTT GCTTAGTGCG AGCCATGACG 
119101 GAAAAACAGC ACAGCGGAAC ACCCAACACT AGCGCAAATA CGCCCATGAG CTGCTCTATT 
119161 TATAGTGTGT AAAGTGCAGT GATTGGATGA TAGAAGACGC TAAATATGAC GTTACACACT 
119221 CTGATTGGTC TATCTTTAAG CCAG CAACAA TCGTGCAGTT TCACCGGCTA CTATATTCTA 
119281 TTCCAACTCT ACAGATGATT ATTTAAGTGG TATTTTATTA CTACTATTAT TTTATTTTAC 
119341 TTTTGCTTTG TTCCCCAAGC TGGTCTTAAA CTTGGGCTCA AAAGATCTTC CCGCCTCAGC 

1194 01 ATCCAGAGTA GCTGGGATTA CAGGGGAGCC CCACTGCGCC GGCTTGGACT TTAATTTTTT 
119461 AAACTTGTCC TCTTCTACAT CTGGTTTTCA TAACCTGAAG GCTGTGTTTA TTTTCCATAA 
119521 AACAAGGCAT TG ATT C C AAA GGTATTATAA TTCCCCAATT CCGTATAACC TTCAGCTCTT 

1195 81 TAGGAAAAAA AAAAAAAAAA AAAAAAGAGG GAATACTGCT CACCTCCTCT CCGGAAATGT 
119641 ACCCTTTACG GGAATTTCTG AAACCTTTCA CAAGAATTGG ATTCCTTTGT AATGCTTTAA 
119701 TTGACTTAGG AGTGTTATTG AAATCTACAA AGCATCTCAA ACATAGTAGG ATTACACTAT 
119761 T ACT CAGAAA CATTTTCTAT GAGACGTCTT TCTCTTGATT ATGCTCTTTG AATCCTAAAC 
119821 TTGCAGCGTT CTGCAGCTTT TGTTTTCTAA AGCCTAGGTG TACTCTGCCA GTCACAAAAT 
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119861 GGCGTTTCTC CAGCACTGCC GCCAGGTACC ACCAGCTGGG AGTTGTTCCT CTTGCGGAGC 

119941 AGGAGGTGGA CTTGGCCCAA GAGAAACTGG ATAGTGGTTC GCAAGGAACA TAATTTAGCA 

120001 TTGCCAAGAG CTAATGCAAT CATTTTGAAA ATCTCAAAAC ACTGAAAAGT GGATTGTGAC 

120061 CTTTTTAAAT T C AC AAG AG A CAGGCCACAT TCTATCTTTT GATTGGTTTA GGCTATTTTC 

12 0121 TTGAAC AG CC ATTTAGAAAG CAGATCTATC ATCCTTCATT TGCATGGAGC GTTCCCATTT 

120181 TATTTGAAAC CAGTTTAACC CAATAGAAAA AAGGGAGGCA GAACCCATTA TTTAAAGTGG 

12 0241 AAACTCCTGA ATCAGATAAT TAGGAGTATT TCCTTTTCAA AAGTTGCGTT TTTTCAGATA 

12 0301 CCTCGCTTAT TACACTAAGA AAGGTTTATA TCTTTCACAA AGGGTTTACT TACAAAAATC 

12 0361 TTCCAATTTT GTATACCTGT GTTTCATAAC TGACTAGCCG TCAAACCAAG ATGTAGAGTT 

12 0421 TCCAACCGTT ATTTTCCAAA TTTTTAGAAA TTACGTGAAA TATTTGAATG CATGCCTTCT 

1204 81 CAATAAAATG GGACGTAGGA AGCACTGGTG CAGAAGATGG GTACAATACT TATCTGGGAC 

120541 CACTCCATTA TTTGGTTGGC ACGTTGTTTG AAGAAAAAGG GGAAAAGCTC AGGTTACTTA 

12 0601 GCATGGTTCG GACTTATTTG AAAACTACCA CAGCAGGAGC GGAAATAAGA CCGCATTACC 

12 0661 TCACTCTCTG CTGTGCTGTG CTAGGGGGTT ATCCAGAATA GGATTGTAGA AGTGGATGTC 

12 0721 GATTTAATAG TTTTTTATTC TCCCATTAGC TGAGTCTCTG ATTGGCAATG TGAGATCGTT 

12 0781 TTAGCTTATT GATACTTTGA AATGCACTTA ACAGCCACAA ACAAGTTAAA GGGTTGTTAC 

12 0841 CATAAAATCT TATCCCCAGG GTGTGCTTGC ATTTATCACC CGTGTTTGCT TTCACACTAA 

12 0 901 GTGGACTTAA CTCCCCAGCA GAATGCCTGT CAGGGAACCG GTTTCGTGGA CCCAGCATTT 

12 0 961 AACGCCTTTC GCAGGCTTGT GAGGCCCATA AATATTTGTT GAATAAAAGA ATGAGTTGAC 

121021 CATGTCATGG TGCGCTGATT GCGTGTGCTG ACATGGAACA CAGGTTGTAA AC CTTAAT AC 

121081 CAATTTGGGG CATGTTGTAT GGATGAAAAG GGCATTGGAA ATTCCTGAAG TGCATCCCAC 

121141 ATTGGACTGT GGAAATAAGT TGCAAGTGCA GAAACGTTTC CACACTTGCA GTTTGAGTAT 

1212 01 TAATTGC AG C GTTTGTGAAT TCTGGTGTTG TCTACGATTC ATTCTTGTTT GACGTGAAAG 

121261 GTATTCGCGA GACACATCGC TCTAAAACAT TGCCAGAAAA TGTAATAGAG TTGATGACAA 

121321 CTGGCCCTAA CACGGCCTAA AACTCGCACT TTTCTCTCCC TCCGCAACTA TTCAAAACAC 

121381 TGTATTTTAC ATTTCTTGCA AATTAAAAAC TAACATCTCT GGCAACGGAC CTCTAAAAAT 

121441 TTCTAATAAA ACTCCTCGGA TGCTTGTGGC ACTGCATTTG TAAACCGCCC CCTCTCAACC 

121501 TACTCCCTAA AAAAGAGCTG CTTTTTGAGA GAGAAGCGGT ACCCTCTGAT GTTACTGGGC 

121561 GGCAGTCTGC CTACAATTTC CTTCACAATG AGGCAACCAG AGCGGCTTTT TCTGTGTGTT 

121621 TGCTTGCGTT GAGGGGAGCA GGACCATAGG CCCTAGAGGC CCCCAGCTGC CTTCTGAGAC 

121681 TGGGCGAAAC CCTCGGCAGC GCGCAGGGGG CGCTAGGGCG CGAGGGGCGG GCACTGACGG 

121741 GCACCAATCA CGGCGCAGTC CCACCCTATA AATAGGCTGC GTTGGGGCCT TTTTTTCGCA 

121801 TCCTGCTTCG TCAGGTTTAT ACCACTTTAT TTGGTGTGCT GTGTTAGTCA CCATGTCTGA 

121861 AACAGTGCCT CCCGCCCCCG CCGCTTCTGC TGCTCCTGAG AAACCTTTAG CTGGCAAGAA 

121921 GG C AAAG AAA CCTGCTAAGG CTGCAGCAGC CTCCAAGAAA AAACCCGCTG GCCCTTCCGT 

121981 GTCAGAGCTG ATCGTGCAGG CTGCTTCCTC CTCTAAGGAG CGTGGTGGTG TGTCGTTGGC 

12 2 041 AGCTCTTAAA AAGGCGCTGG CGGCCGCAGG CTACGACGTG GAGAAGAACA ACAGCCGCAT 

122101 TAAGCTGGGC ATTAAGAGCC TGGTAAGCAA GGGAACGTTG GTGCAGACAA AGGGTACCGG 

122161 AGCCTCGGGT TCCTTCAAGC TCAACAAGAA GGCGTCCTCC GTGGAAACCA AGCCCGGCGC 

12 2221 CTCAAAGGTG GCTACAAAAA CTAAGGCAAC GGGTGCATCT AAAAAGCTCA AAAAGGCCAC 

122281 GGGGGCTAGC AAAAAGAGCG TCAAGACTCC GAAAAAGGCT AAAAAGCCTG CGGCAACAAG 

122 341 GAAATCCTCC AAGAATCCAA AAAAACCCAA AACTGTAAAG CCCAAGAAAG TAGCTAAAAG 

122401 CCCTGCTAAA GCTAAGGCTG TAAAACCCAA GGCGGCCAAG GCTAGGGTGA CGAAGCCAAA 

122461 GACTGCCAAA CCCAAGAAAG CGGCACCCAA GAAAAAGTAA ATTCAGTTAG AAGTTTCTTC 

122521 TAGTAACCCA ACGGCTCTTT TAAGAGCCAC CTACGCATTT CAGGAAAAGA GCTGTAGTAC 

122581 ACAGATGAAA TCCCCCAAGC AAATG C AAC A CGCCCTCAAT TATATTAGAA TCACTTGGAG 

12 2641 AGTCGATAGA ACTTTAACAT AGCCTCATCT AGTAAGAATT TACTACTCAA TCTATCAAAG 

122701 ATAGCAAGGT GAATTCAAAT GCACCGAGTT AAAATCGAGT TTTAAAGTCA CCTGGGTTTC 

122 761 GGTAGCCGGA AGTCCCGCGT CTCACGACTC CAAGCTAATT AGTCATAACC GTATTGAACC 
122821 AAGGTTGAAG CCCAGTCCCA GGCTTGAGGC TTTTTATTAT ACAAGGTTAA AGTGGGGATA 
122881 TTGCGTTTTG GGGTCAATAT TGCTAAAGTA GCATTTTCCG AAATTGGGTG GTCCTAAGAA 
122941 ATGCTTCTGG GATAGTTGGC AAAATATATG GCTTAACCAC GCCCTCTCCA CAGGAGTGGC 

123 001 TAGCGAGCTG TCTGTCCTTG GGAAGGACGG TGACCCTGCT GGCGTGGCTG GCGCCCACGT 
123061 TGGCGTCCTC TGAAAGCCCC GCCAGGTAGG CCTAGCTCGC TTGCTTTCTG CAGCGCCATC 
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12 3121 ATGACAAAGC TTTGAAACGC AAAATGCTTT CTTTGTGCAG CGCCTTACCA TGGGTG C ACT 

123181 TACGGGCTGT CGACTTGGTT TAGGCCCTTG TCAGGACAAA GGAGCTTAGT TTGTTGGAGT 

123241 TTTAG AG CTG CAACCCAAAA TCCCTTGCTC GGTTTCTCTG TTTTTAGAAA CGGAAGCGCC 

123301 CTGATTGGAT ATTTGAAAAT TACTGTGCTT AACTGGATCG TGTTTCATCA ATCGTGCAGG 

123361 ATTTTCAACC CTGGTGGAGC CCACACATTC AAAACTGAAG ATCCTTTTCT CAGAACTGCC 

12 3421 CCTTTAAGCT TTTGCAATTT TAATTCTGGG GGTCAGATTT TAATAATTGG ACTTTTTTGT 

1234 81 TTACATCTGA CAAGAGTATA TGATGAGCCA AGTTTACTCA CTTTTACTTA GTGCAGTTCA 

12 3 541 ATTCTAAAAG TTTATTTTTG CGTGTGTGCA TATGAGTTAA TAATCAGTTG TATTTTTCAA 

12 36 01 ACGGTCTTTT TTCAATTGTT TTGCTTAGCT CCTTCCATCG TCTAAAGTCA GGGATACAGG 

1236 61 CACATCACAT CCCTGTTCCC CCTTCCTCAA ACTAATATGT AGCTACCTAG GTTTATCCTT 

123721 TAAAACAAAA ATTCTCACCT ATTTTTGTGA GAAATATACA TGTTTTTCTT TGAACTAAGT 

123 781 ATTTTACATA CACCTATCTA TATACATGCA TACTTGTGGT TTTGTTTTTT TAAAAAAAAA 

123841 AAAAAAAAAA CACGTTATCT TTTGAGACTG GGTCTCAGTC TGTTGCCCAG ACTGGACTGC 

123 901 AGTGGCATAA TCACAGCACA CTGTAACCTC CAACTCCTGG GCTCAGGCTA TCCTGCAGCC 
123961 TCAGCATCCG GAGTAGCTGG GATTGCATGC ACGCACCACC AAGCCGGGCT TTTTGTTTTT 
124021 ATTTTTTGTG GAGACAGTCA CACCATGTTG TCCAAGCTGG TCTAGAAATG GCCTCAAGTG 
124081 ATCATCGACC TCCCAAAGTG TTGGGATTAC GGTCACTGTG CCTGGCCTTG TATGCATAAT 
124141 TGTTTTGTCT TTTGATTAGG GTTATTAATT TAAAAAACAA AGCCTGGACG CAGTGGCTCA 
124201 CATCTGTAAT CCCAGCACTT TAGGAAGCCG GATGGGCAGA TTACTTGAGC TCAGGAGTTC 
124261 AAGACCAGCC TGGGCAACAT GGTGAAATCC CATCTTGACA AAAAATACAA AAAATTAGCA 

124 321 AGGCCCAGTG GCACGCACTT ATAGTCCCAG CTACTTGGGA GGCTGGGGTG GGAAGATGAC 
12 43 81 TGGAACCTGG GAGGTAGAGG CTGCAGTGAG C AG AG AT CGT GCCACTGCAC TCAAGCCTAG 
12 4441 GTGACAGAAT GAGACCCAGT CTCAAAACAA AAATAATAAA AATTTTTTAC AACGATGTTA 
124501 TATACACTTC TGCATGTTGC TTTTCTCTTA ACCAAACTTT TCTAAAACCC TGTCATGAAA 
12 4 561 AAAGAAATCC TTCACATGGA ATAGCATAAG TTATTCATCC ATTTCTTATT GATAAGCATT 
124621 GATGTTTCCA GTTACCACTG CTGAACATGG TGCAATTGAA TAGAATTCCA GGGCTGAGAT 
124 681 TGCTAGGTTT TAGGTTGTAT TTTATTATTT TATTTATTTA TTTATTTATT TAG AC AG AG T 
124741 CTTACTCTGT CACCCATGGT GGAGTACAGT GCCATGACCT CAGTTGCAAC CTTTGCCTCC 
124 801 TGAGTTCAAG CGATTCTCAT GCCTCCGGTC TCCCGAGTAG CTGGGATTAC AGGCACCTGC 
124 861 CACCAGGCCT GGCTAATTTT TGTATTTTTA GGAGAGATGG GGTTTCACCA TGTTGGCCAG 
124 921 ACTGGTCTCA AACTCCTGGC CTCAAGTGAT CTGGCCACCT CGGCCTCCCG AAGTGCTGGG 
124 981 ATTACAGGTG TGAGCCATGG CTCCAGACCT GGACTTTGTC TTCTGTTTCA TCAGTCCTTC 
12 5041 TGTTGGTTCA AGCACAGTAT CACACTGAAG ACTGATGATT CTATATAAAT ATGGTAAAGA 
12 5101 CTGTACACCC TAACTGTTCT TATTTTTTAA TTTTAAGGCA ATTTTAGATT CCAGCTTTCC 
12 5161 AAAGAATTGT GGAATGCTTA GAGCTAGAGA AGCCTTGGAA GTCATTTAGT TTTTGTTTTG 
12 5221 T C AG AG AAAA TTCTGTAGAG ACTCTGTCCT GCTCTCACTG AATACCATCC CATAGTACCC 
125281 CCCAACAGCT TTAAAGGGCA ATAATACCTT ATGGACAGTA TGCTTTTCCT CAAATATATT 
12 5341 CTAAGCCATG GTCAATGCAA AAGAGTGAGA AGGAAAGTAG AATAAGTTAT CTAAGAATCA 
125401 GTGGGTGCTC TCTTTAAACT GATTTATCAC TCCCCCTTCC AAACTCTCTT GAAGGTCACT 
1254 61 CTGCCTCCCT TTCTACATAA GAACTCCTAA CTCCAAGGGA GGAAGGTAAG TTATTCTTAT 
125521 TCCTTGCTTA GAAAAAGAGA AAATAGGTTT GGTAAGCATC CGCTTTCTGC TACCATTCTC 
125581 TGTGTTTCTG TGTTTTTTAT AGGATCATTC AATTATTGGT TGGCTCTTGA GAGGGAATGC 
125641 AAGGTTCAAG GACACAAGCC TAGATCTTGC CTGTATAGAA CCTCATGATG TTATGCTTCT 
125701 CTAAAATGAG GCCTGGAGGA GACATGTTGA AAGTG AC C C A TAAATCTGCA GTATCTCATG 
12 5761 TCTCTCAATG GGGACAAGGA GTACCATGGG AAATAGCATT AGGTCAATGA CAGTAACAAC 
125821 TCCCAGGTGA GTTGATTTAT TCTTTTATTT ATAAAGTTGT TAATATGCTA CATAGTCCCT 
125881 AATTTTGCCA CAAATAGTCA TTATTTTAAT TTCAT AT TTC ACTATTGATA AATGAAGGAA 
12 5941 AAAATGAGTA GCAGTTAAGC AGTCCATAAA CCTACATATA AAGCAAATTG GAGATTTTAA 
126001 AATTGATTCT GGATGCTTAA AATCCTTCTC ATTGAAAAAA AATTTCGTAT TAGAAGATTT 
126061 CAACATTCTT TAAACTGAGA AGCATAACAT ATAAACAGAA AACCACAGCA AAACAAAAAT 
126121 GCAAAGCTCA ATAAATGAAC ACAAAGTGAA CACCATAATA ATTGCCACAC AAGTAAAAAA 
126181 ACAGAAAATC AGCCAACCCT CCCAGAGCTG CCTGATGCTT GCTTCCAGTC ACATTATCAC 
126241 TCCATCTGCC CTAAACATAA CCCCTATTTT GATTTCCAAT GCTGTAATTT AGTATGCCTG 
126301 TTTTTGAAAC ATATAAAATG GAAATAAAAC AAATGTAATC CTATGTACCT GACATATTTC 
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CAAAGGTTGC 

TTCAGTGGAA 

AAGAGAATAT 

TAGGAAACTA 

TCCCAGTAAT 

ACAGAGCAGT 

AACCATGGAT 

AAGG CTTAAT 

TTAAGAATAT 

TTTTTGGACT 

TGTATTTCAC 



TCTGTGTTGC 

AGTGCAACAA 

AAAC AAT AC C 

TTTGTATACC 

TATTAGGTTT 

ACCTCACAAA 

CCAATATTTT 

TGTGCAGTAT 

GATTGTATAT 

AAAAGGAATT 

CAAGAGATTC 

ATTG TTTGGT 

TTCTTACAAT 

TTCTTACTGT 

TTGTGTCCTC 

TTATAAGGAC 

AATATCTCCA 

TTTCAACATC 

GGTTTAAGAT 

TGCTGTGTCA 

CCTGGGTTCA 

GCCCTGCTAA 

CTCAAACTCC 

GGTGTGAGCC 

AACCCTTTAT 

AGAGGCTGGA 

GCGATCCTCC 

AGCTAATCTT 

TGAGCTCAAG 

ACCACTGCAC 

AAATTAAGCA 

TTCTATTTCA 

AGACTACCCT 

TTAGGTTAGT 

TACAATTTAT 

AACTACTTTT 

AG C AG AT ATG 

TAGTTGAAAA 

C AC AAGG CAG 

TTAAAGTCCC 

AGATATAATT 

GTAGTATATA 

GACACACACA 

CAGCAAGTAC 

GCATAGATCT 

ATTTGTTGTT 

ATATAGGCAC 

ATG AAC AG T A 

CCCCACTTAT 

AGTACTGAAC 

TTATAAATTA 

ACTGTATAAA 

GCAGTAGACC 

CCGAGCCTCA 



TGTGTATAAC 

TTTAGGTGTC 

GTGAATATTC 

TAGGAGAGGA 

AC ATG CGATG 

CCTAATGTTG 

TTTCCAATTA 

TATCTCATTA 

CAGTCGACTT 

TATTACCTCA 

AGTTTCTAGT 

GCTGAACAGA 

TCTGGTGGCT 

TATGGCAGAT 

ATGTGGCCTC 

ACCAGATCTA 

TAAAGTCCCA 

AATTTTGGGG 

AGCTGTCTTT 

CCCGGGTTGG 

AGAAATTCTC 

TTTTTGTATT 

TGACCTCAGG 

ACCAAACCTG 

CCATTTTATG 

GTGCAGTGGC 

CATCTTATCC 

TGTATTTTTT 

CAATCCATCA 

CCAGCCAATG 

CACTGCCCTT 

AGGTCATGAA 

CACTTCTAAC 

AATTTGCTAA 

TATAGGATAT 

AAAAAGATTT 

TGTGCACATA 

GGGGAAACAA 

TATTATGGAT 

AACTCCCAGT 

AGTTAAGATG 

TATATTTTTT 

GGCAGAGACT 

GAGAAGCTAG 

AATGATACCT 

TTAAGCCACC 

AATCCAGGCA 

TTACTTTCCC 

GCACAAAACA 

TCTATATAGC 

GGCACAGTAA 

AGTTAGGTGA 

ACAGGAACTA 

GTGTGCAGTT 



TTTAATTCAT 

TACTGTTGAT 

CTGTGTATGT 

ATTGTTGGGT 

AAATGAAAAT 

ATGGAATCTA 

AAATTGTTAA 

TGGTTTTATT 

GGGCTGCCAT 

CAGTTCTAAA 

GAGGGCTCTC 

ATACCAGAGA 

ATAAAGCCTA 

GTGAGATGTC 

TTCATATGCC 

TCAGACTACT 

AAATCCCTAT 

GAACACAATT 

TTGTCCTTTT 

AGTGCAGTGG 

CTCCTCCCAA 

TTTGATAGAG 

TGATCCACCT 

GCCTGTCTTT 

TGTTGCAGGT 

ACAATCACAG 

TCCTTAGTAG 

TGTAGAGATG 

ACCTTGGCCT 

ATATCTCATG 

TTATGCACAA 

GATCTTATTT 

ACCAATTATA 

AAGGACTCAC 

ATAGCTTATT 

TAG C ATTTG A 

TATACCAAGA 

CTCAAGGAAT 

CGAATTGTAT 

ACCTCAGAAT 

AGGTTATAGT 

AATAGAACTA 

GCGGTTATGC 

GAAGAGTCAA 

TCATGTCAGA 

CTAGCTTCTA 

AGTTCCAAAT 

AG CATTAATG 

TATGTTCCAG 

TGTTTTTTCC 

GAGATTAATA 

ATGTTTATTT 

AAACCATGTA 

TTAATGGCCT 
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GCCATGGTTG 

CTGTGTGGCT 

C C AGGTAAAC 

CAGTAGCTGA 

TTTATTTTGC 

TTTTTTGTGT 

TTCTGTTTAA 

TTGTTGTATG 

TTCGTTTGTC 

TTTAATAAAG 

CCTGTTTGTC 

TGTAGATCAG 

TACTCCTCCA 

TCGTAGAAGT 

AC CAGCACTT 

CTGGCCAACA 

GCGGGCACCT 

GAGGCAGAGG 

GAGACTTCGT 

AATAACTAAA 

CCTCCATATA 

GGCTGGGAAT 

GCTTCATGCC 

CAAGGGGCTC 

GGACACCTCT 

GGAACTCACA 

GGTTGCAGAC 

GTGAGAACAT 

CTTATCTGAA 

AGCTTCATGA 

GTTGTCATTC 

GGATCATTTA 

TTACATATTA 

AGTGGATGGA 

GATGTACAAT 

GAAAATTAGG 

CTAGGGTTTT 

AGATCATGTG 

ACCAATACCC 

GATAGTAGAA 

AC AT TTAAAT 

AGAAGCTCCC 

GAGAAGGATC 

AACCTTCTTC 

CAGCTGGGAG 

GGGAAGAAAA 

CCAGGAAAGG 

AAATACTTGA 

TGAGGAAGAG 

GAGGAGGAGG 

CAGCCCCAAG 

GCTTCTCCCT 

TGTAAGGAGA 

CAAGGAAGAT 



ACTGCTCACA 

CAAAGTTCCT 

AAAGACACAC 

GGGCAAAGGC 

TTTTCATGTT 

ATAGTGTGAA 

AAGGCTATAC 

GTGCCACTTT 

TATTTTTGGA 

CAACAGATTT 

AATTTCTATA 

TTTGGGGACA 

TTATTTAGTT 

CAGATACGTA 

TGGGAGGCCG 

TGGTGAAACC 

GTAATCC C AG 

TTGCAGTGAG 

CTCAAAAAAA 

AAATGAACAG 

TACAAAATGG 

CTCCCTGAAA 

CAGATATGGG 

CTTCCTCAGA 

TAAGGAAGTT 

GCACAACTGC 

CAAGTCCGTG 

GACCTCCCTG 

ATTCCTTTTC 

GCATTTATTG 

ACTATCCACC 

TGCATGTATT 

CTTTTGCTAC 

CTGAGGCAGA 

TAGAGCAAGA 

AGGTAGAATC 

TAACTGGAGC 

TTGTCTCAAA 

ATGGTGCAGG 

AATCATGAGT 

TGGTAACTCT 

ACACTTTTGT 

CTCTTGTGGC 

AAAACAAAGG 

AAGGTAATTC 

TTAGTCAACA 

AGCAAAGGTC 

GGAAGAGGAG 

GAGGAGGAGA 

AGAAGGAGGA 

ATATAGGCAT 

AACACCATCA 

GACAAAAGTT 

AACTAAAAAG 



TGGCCGATCT TTTAGTCTAC 
ATTATAAATC ACATTGTTGA 
TTATCAGTGA GAACATTTCA 
TAGACCTCTT TTTGGGTAAG 
TAACTTTATT TTGCTTTTCA 
GTAGGGGGTC AAGTTTCTTT 
AATTGTCCCA TGCCATTTAT 
AGATGTAAAT CAATGTCCAT 
CAACACTGCC CTGATTATTG 
GTTTATTTTG GGCCCTTGGA 
ATAAAGCTTA TTGGGAATCT 
ATTAATACCT TTAAAATATT 
TTCCTGTTTA ATTTATCTGA 
GAAAATTCAA AGCCCAAGTG 
ATGTGGGTGG ATCACCTGAG 
TCATCTCTAG TAAAAATACA 
CTAATCAGGA GACTGAGGCA 
CCAAGTTCCT GTCACTGCAC 
CAAAAAAAAG AACATTCAAA 
TTATTAAAAT ATCAGGATAT 
C CAGTTAG AG AAAAAAAAAA 
ATCTTTGAGA GCCTTGGCCC 
TACAGTTCCT TGTTTAAAAA 
GCACAAGGAC CTCCATAACA 
AGACTTCCAA AGAATGGTGT 
TCCTTGGAGT TCGGTTTCAA 
GCCTCAGTGT CCGGATGTAC 
AAACCACCAC AAGTATTGTT 
TTTAAAAATT CAAATTACAT 
AACCCACAGC TTTTAAAACC 
AATTATTTAA TTATTGATCA 
TCTTTTATAT TGCATATTTT 
AGTAATAGTT CAGAAGTGTA 
ACTGGAGGCA AGAAAATGTC 
GAGTAGCACT GAAATTGAAG 
AACAGATTAG ATGTAGGGAT 
AAGTAGGTAG ACAGAACATT 
GGGCATGAAG AGTAGAAAGC 
GAGAGGGAGG GAGATCTGCT 
GTGTGATGTC CTGGAAGTTG 
GACAAAACCT GGAGGCCAAC 
GGGCATCAGA AAGCCCACCA 
TTTGGCAGGG AGAGAAGAAT 
CCTACTCTCA AGGGGAAAAC 
TTCCCACTGC AGCCTCATCT 
GGGATCAGAG CTTCATGAAA 
TGAGGAGGAG GAGAAGGAGG 
GAGAAGGAGG AGGAGGAGGA 
AGGAGGAGGA GGAGTTGTAT 
GGAGGAGGAG TTGTATCATT 
GCTAATAAAC TGAGGCTTAA 
AGGCTCCAAC TGAATAACAA 
AGAATGAGAC AAGTATTGTT 
GCACTCTGGA TTTAGAAATA 



CTCCACAGGT 

CTGTGTGGTG 

AGGGTCTAAA 

ATAAATTTTT 

TGTTAGTTCC 

TTTTTTCCTT 

TTACAAGAGT 

ATTTGTTTGA 

TCATTTTATC 

TTTGTGTATT 

GATTAGGATT 

GACCGCTTCA 

GTAATACATT 

CAATAGCTCA 

GTCAGGAGTT 

AAAATTAGCT 

GGAGAATCGC 

CCCACCCTGG 

TAATCAATGT 

AAAAGCAAAA 

GAATAGGCGA 

TGCCCTCAGG 

AATTTTGCTC 

CCGGACACTA 

TTCCTCTGTC 

ATCTACAAGG 

GGTGGCCTTG 

TCATGTTATG 

ATTTTTCAAG 

TACTGAACAC 

ATATTGTTTC 

ATATTTCTGC 

CATCCAAAAT 

ACAGTAATTC 

AAAAATAGAT 

GAGAAGGGTC 

TCTTCCTGAA 

CTGGGACAGA 

AAAAAGACTG 

AGACAGTATC 

TGTGAATGCC 

GGTTCCTGCA 

TATTATGAAA 

ATTTTGCCAG 

AGGCTTTCTG 

ATAAATTGGA 

AAGAGGAGTT 

GTTGTATCAT 

CATTATAAAC 

ATAAACACTT 

CACTTTGACT 

TGAATTATGA 

ATCTAGAGAT 

GGAAGTCATT 



AGAGCTGATA 

GTCAAAACCT 

ATTCATCTCC 

TACCATATAC 

CCTGGAATTG 

TTTGTTCTTT 

CCTTTCACCA 

GCCTGTTCCA 

AGTTTTGATA 

AAATTTGAAC 

ACAATGGTTT 

ACTGTAAATA 

ATAGTTTTCT 

TGTCTGTAAT 

TGAGACCAGA 

GGGTGTGGTG 

TTGAACCCAG 

GCGACAGAGC 

AGATAATTCA 

AAATCAATAA 

GACTTAAAAA 

GATTTCTCTG 

CATCAATCAA 

GATGTCTAAG 

CCCAAACTCT 

CTGTCATGGA 

GCACCTGAAT 

TATGTTTTTT 

CCCCTGAACA 

TTTGCTCTAT 

CTTAGTGTTG 

ATTACAGTTA 

TTAGCTGTGA 

TAAAAAAGAT 

GCGTTTGAGA 

AAAGATGACA 

AGGGCAGGTC 

TCCTGAGATG 

CAAATGTCAG 

ACATTTGAGA 

CATGAGAGTG 

GTGAAGATCT 

TACACCCCAG 

AGTCTTATCC 

TCTCACTTAA 

AATGGTGCAG 

GTATCATTAT 

TATAAACACT 

ACTTGAGGAA 

GTGACGGTCC 

ACAGAATGCT 

ATGAAAGAGC 

GCCAAGAAGG 

AGTGACCTTG 
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TAAATAATGG 
TCAGAGGTCA 
CAGCTGCATG 
CTGAGCTATG 
AAAAACAAGT 
TGTCTTGATA 
TGTTCTGCCT 
AGCTGGGCAG 
ATGTAGATGG 
GGTGGGCAGC 
CCATTGCCTC 
CACTGAAATT 
AAATGGAAAA 
CAGGTTTAGG 
GGAGTGTATG 
TTTT AG GAG C 
ACTGCCAGGT 
CACATCCTTC 
TGGCTAGCAA 
GTGACACTGG 
TTAGCCCTGT 
GTTTAACTTT 
GTATAAAAGA 
TCATTACACA 
TGTGTCCATT 
GAGGTTTTAC 
AAGTAGGTGC 
AGAGCTGCCA 
CTCTTCAATA 
TATGTAAAGT 
TCAGTCAGGT 
TTTTTCTTCA 
AAAAAGTTTA 
AGGGAGTTGG 
GGAAAGGAAA 
ATATTCACAT 
TATGGCCATT 
AAGTAACTAA 
ATCTGGCTAT 
AATCTGGGCC 
TAATCTGAGT 
AAG CCTTAGA 
CCCCATCTGA 
TCTTAGTTCC 
TTAATGGGCT 
AGGGCCATTC 
C TGTATTTG A 
TTGGGAGGCT 
GTAACATAGC 
ATAAAAGCAA 
CCTTTACATA 
TTTTAATCCC 
TCCAGCCCAC 
TGCACCTAGT 



AGCCAGAGGA 
GGAGGTGTAA 
GAGATTTGGG 
TGTAAATAGA 
GCACAGTTAT 
GAAGAATCCT 
GTCTCTGACT 
CCCTGCCCTC 
CAAAGGAGAG 
TCTTCCTCCA 
TTCAGATTTA 
GAACAAAGAA 
GTGGAACAAA 
AGAACAAGGA 
CTGGACCACT 
ATGGAAGATC 
TGAAGAACCC 
CAGCCCTCTA 
AGCGGTATTG 
ATCACTACTG 
CACAACTTTG 
TTCTGTTGGA 
TTATCTTTTA 
ATCTATACAT 
AAAAATAAAA 
CACGATGGCT 
TTATAGGGGT 
AATAGC ATG C 
TCTACACTTA 
GGAAACAGAA 
AAAATCTGTG 
GTCATGCTCA 
CCCATAATCG 
AGACACAGAA 
GAAGTGGCAA 
GTTAAAG CTA 
ATTTTCCACA 
GAGCAAAAAT 
AATCTAAGTG 
TTCGCAACAC 
AGTTACGAGT 
GCTTGGGCGC 
TAGTCTGATA 
TCCTGAAATG 
TGCCTTGTGG 
CATCTTC CAG 
AAAGAAAGAA 
GAGGTGGGAG 
AAGACCCTGT 
AAAGAAAGAG 
GTTCAGGATC 
CATCATCTCT 
CATTGTCTCT 
GCACCTAGAG 



ATACCAAGGG 
CTGACTCTCC 
AGAGGGAAAG 
ATAAGACAGG 
CTAAGGGAAA 
TGATCTGGTT 
TGCTCTGTGC 
TTGCAACAGC 
AGAGGTTAGT 
CAGTCCCAGC 
AAGGTGTGTC 
TTTTATGGGA 
CGTATCAGAG 
AAAGATTCTT 
GATGATGATA 
TCCAAACTGA 
ATTCTGTGAG 
GTCTTCCTCC 
GAAAAGATAG 
TTGCTGATCT 
TAGATATCCC 
ATCCTAATAT 
CCAAAAAAAA 
GTATCAAAAT 
ATTAAAGAAA 
GTTATTTCCC 
ATAGAGGGGC 
AGGTCCCATG 
AGCCTAACAC 
CCAATCTGGC 
GATATAAATT 
ACGATGCTTC 
AGCTGTGTCT 
ACAGTGTTTG 
TAGGAAGGAA 
ATTCAGTTTT 
AC C ACACTAA 
ATCCACAACT 
ACCCTCCACT 
CATGAACTGT 
CCTGAAGCTA 
TGGCGGGTCC 
ACTAGAGAAG 
TTCATATTTA 
TCCATACCAC 
GCAGTAGAGT 
TCATTTGGGT 
GATCATTTGA 
CTTTAGAAAA 
TC CATC TTAG 
AATTATAATA 
CTGAGTTTCC 
CCTCCTATAG 
TCTACTCCAG 



CAGAAGCCTC 
CACAGTGTGG 
CTTTTTTTTT 
AAG AGTG TAG 
CAATGGGATC 
TATTCAGTGT 
CCCAGAAGCC 
TGGATTTGGC 
GTACTTATTC 
TCTGGCCTAG 
TGTCAGGGTA 
ATGGTTGTTA 
ATAGTAATGA 
TGAAGAGATC 
TGTCTGTAGA 
AGCCAACTGC 
GATGTCAACA 
AGTGCTTTCT 
AAGAGACTAA 
TGGGCTGCCT 
TTCATTATAT 
GGCACTCCTC 
GACAAAAAAC 
ATCACATAGT 
AGATGGTAAA 
CCATGAAGGG 
TCAAAGCTTT 
GGGGCAGAGC 
AAAGTGTGTG 
AAACTTTGTA 
TATATTGATC 
AGCCATGCTC 
GTGTCTGAAT 
AAGTAATGGG 
CAGAGATCTG 
CAATCATCAT 
AACTTTATTA 
ACCATTTGAG 
GAATGTCAGT 
TCTTGTCTTG 
GAAAGATGGA 
TGTCTCACCG 
CCGGCCAACT 
GAAATTATTG 
TGAGTGCAGA 
TCAGTACTTC 
GTGGTAGCTC 
TGCCAGGAGG 
AAAAAATACA 
GGACAGACTG 
AAACACTTTT 
AGTCAGTTTC 
CTCCACCAAC 
AATGCTCATG 



ACTATAGTGT 
CTTTGGAAGA 
TTTTTTTTAA 
ACACAGGAAA 
AAGCTGCAAG 
TTGGTCCAAA 
CAGCTTCTAC 
CAGTGATCAG 
CCTGCATCAC 
CTCTGGTTAC 
TAACTGGGAG 
ACTAGTTATA 
CAGAAAGCAA 
CCCAGAACTG 
TAGAGGCATG 
TGTTACTGGA 
AACAAAGTGG 
ATTGGTAGGG 
ATCTTCATAA 
CATATCCCCT 
GCCCTTCATA 
CATTTTTCAG 
TGATCTAATT 
ACCCCATAAA 
TATAGCTCTG 
GGGAGTGAGG 
GAGAGAGGAG 
CTCTGCTCAT 
CTTAATAAGT 
GGACTGGTGG 
AAAAAATTCA 
AACTCTTCTG 
AATGAAAAGA 
TAATGGAAGC 
TGGTCCTATG 
TAAAATTTTG 
CCTCTGGCAA 
CTATCAATTT 
ATCTTTGCAT 
AATATCCAGA 
AACCCCATTT 
GGACAGAGGG 
TATTCTCCAA 
TTTGTCAGTA 
GCTTGCCTGG 
TTTAAAATTG 
ACACCTGTAA 
ACCACTTGAG 
ATAAAATAAA 
TAACTACTCA 
GTGCAGATTC 
TCTGCATGTA 
AAATCAGAAC 
GAGAAAGTTT 



GTTGCACCTG 
GAGAAGTCAG 
TTGGAAAAGA 
GAGGGCAGAC 
TATATAAACT 
CCCACATCCC 
AGATAGCATT 
CCCAGCAGGA 
CCCCCTGCTT 
AGGTTCCCTC 
CTAGAAATTG 
AGAGGACTGA 
CTACCACCTC 
GGACCTCTGA 
ATGAGGCTGA 
TTCAACTGCC 
GAAATCTTTT 
TTTGGGGAGG 
CCAGCACAGG 
GTTCTTCCCA 
TATTCTTTTG 
G AC C AAAAG A 
CCTGATTTGA 
TATATACAAC 
TCAGGCAGTG 
GAGCAGCTGA 
AATGTCTGAA 
TCACCAGTGC 
ATTTGCTGAG 
GCAATGAAGA 
AGGTTAGGTG 
TAGCCACAGA 
C CATGATGCA 
ATGCTACCAG 
TCCCCTGAGC 
TTCCTAAATA 
GTGACTATGC 
AGGGAAAGTC 
ATGTGATTTA 
TTGAAGGAAA 
GCTCATCAGA 
GCTCTTTCCT 
GAAGGAGCCA 
ATTTAACCCC 
AAGAATTGTG 
CTGCTGAACT 
TCCTAGCGCT 
ACCACCCTGG 
TACAATAAAA 
CTGGAGCTTA 
AATAGGATTA 
GACACCCTTC 
TTTTTCTAAC 
CTGAAAGGTA 
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AAACTCTGAA 
GTAGACTTCA 
AGAGTATGGA 
GTAAGGATTT 
CAGGCTGACC 
TCTACTGTAA 
TTCAGAACAA 
GAGTATATTT 
TTAACCATTT 
TTTATTTATT 
GTGGCGTGAT 
TAGCCTCCTG 
TTTTAGTAAA 
GATCTGCCCA 
TTCAGATTTG 
ATATTTTGGC 
CCTTCTGAAT 
AGCATATTTA 
CTTGTAACTT 
GATTGCTTAA 
TTTTTTCCCC 
CTATGGTGGC 
TTAAATTTTA 
CAGGCTGGAA 
CCATTCTCCT 
GCTAATTTTT 
ATCTCCTGAC 
AGCCACCGCG 
ACGTATGTTT 
GTTTATGACA 
CTACTGACAG 
TGTTTCTGTC 
TCCTCCCCCT 
GGTTTCCTCT 
ATAGAAATGG 
CTACAAAAAC 
GCCTGTAATC 
GAGACCAGCC 
GGCGTGGTGG 
CGAGCCTTGG 
ACAAAGTGAG 
AGGATTCCGT 
CCACACCACA 
AAGAATTTAG 
AAAGGGACAG 
GTCCCAATGG 
GATCGTTGAC 
CACCTGCTTA 
TACCTGTAAA 
ATTACAATAA 
AATAACTATA 
CTTATTGAGT 
TGTGTAAATA 
TTGGTTACCT 



TGATATTTGT 
GTAGAAGAGG 
AACCCAATGG 
CTCAATGTGT 
ACACAATTAA 
GCATTTAGAA 
AATTATGAAG 
TGCATATATT 
TTACTAGATT 
TATTTTTTTT 
CTTGGCTTAC 
AGCAGCTGGG 
GACGGGGTTT 
CCTTGGCCTC 
TCAGTACGAC 
TGCCCTAATT 
TGGTTTATTA 
TTCTGTTAGA 
TATCTCATAA 
AATTTTATTA 
CAAAATGCAT 
TAGATTTGTA 
CTGACTTTTT 
TGCAGTGGCA 
GCCTCAACCT 
TGTATTTTTA 
CTCATGATCT 
CCCCGCCTAC 
CTTGCTCTTG 
TTGTTTAAGT 
CTCACACATC 
TCCCAGCTAC 
TTCTTCTATA 
ATAGTAGAAT 
GGAAAAGGAA 
CCAGCAGGAA 
CCAGTACTTT 
TGGTCAACAT 
CATGCACCTG 
AGGTGGAGGT 
ACGCCATCTC 
CAAGGCCTGG 
CCGTATCCCA 
AGTCACAGAG 
AATGTAAAGC 
TCTCCTGGTG 
CCAGTTAGCA 
GGTTTTATCT 
GCTCCTTAAA 
AATGACTCAT 
AAATATGGTA 
TTGCCTAATA 
TGTGTAACAC 
TTTCTAGTTC 



AGCTAAAGGG 

AATGACACTG 

GAAAAATGCT. 

TCCCAAATTT 

AGAAGG CTGA 

AAAGAAAGTT 

AGCTATATTT 

GTTATAGTAC 

ACTCTGTGTT 

TTGAGTCAGA 

TGCAACTTCC 

ATTACAGGTG 

CACCATGTTG 

CCAAAGTGCT 

TGTATTTACA 

AATTTTACAA 

AAATATATTC 

ATTTCATAAT 

CTTTATGCAG 

TAAACAAGTG 

GTTTCCATTT 

GACTAAATAT 

TTTTTTTTCT 

CAATCTCGGC 

CCCGAGTAGC 

GTAGAGACAG 

GCCCACCTCA 

TGACTTTTAT 

TTATTAAATA 

TATTACATAA 

ATCAGTTGAA 

TAAAAGAAAC 

AATCTGATTC 

AGAGATCCTA 

GGTCTGCAAG 

TTCCAGAAGA 

GGGAGGCCGA 

GGCGAAACCT 

TAGTCCCAGC 

TGGTGAGCCG 

AATCAATCAG 

GGCACACAGG 

CAGGCACTGC 

GAGGAGGCAC 

TTCAGCTGAT 

ATGTAGGTTC 

TTCAAGCAAC 

GTCCAGGCAA 

GATGATGCCT 

GCAAGACACC 

AGTTTATAAA 

ACCACACAGC 

AAACTTGTAG 

ATTATGTAAG 



AGACTTGCTA 
CAATGTCAGG 
CACCAGGAAC 
GCCCAGCAGA 
ACCTGGGGGC 
ATCCATTCAA 
ACTTTTCTAC 
ATATTCAATT 
CATAATAATC 
GTCACACTCT 
ACCTCCTGGA 
TGCACCACCA 
GTCAGGCTGG 
GGGATAATCA 
CTCATTTGTT 
TTAATATGAT 
TAATGTAATT 
TTATAAAGCT 
TTACAAGTAG 
TAAAAAACAA 
TAACAGAACC 
TAAAAGTCCC 
TTTTCTGAGA 
TCACTGCAAC 
TGGGACCACA 
GGTTTCACCG 
GCCTCCCAAA 
CCAAAGAAAA 
TGACACATTT 
TTAATTCATA 
CAGCAGAAAG 
GAAACCCTTC 
CATGTTAGTG 
TGGCAAAAGT 
AGCCAATGTG 
AAACTCCTCA 
GGTGGGCAGA 
CATCTCTACA 
TACTCAAGAG 
AGATCACGCC 
TCTCCTCGAA 
AAAATATTAA 
GGATGTGCAT 
CAAGCAGACT 
TATCTGGCCT 
TTAGGTTTCT 
TTCCACCCTG 
TAATAATAAA 
TCTAACTCCT 
CAGGTAGTTT 
AGTTACATTG 
ACAATAATAA 
AAGGTATATC 
TGGCATAGCT 



GAGACAATAA 
GTGCAGGACT 
ATGAAGAGAA 
GGGAGGCCTC 
TTTTAACAAC 
AAATATATAT 
ATTCTAATTT 
TTGTATTTTG 
ACTTTTTTAA 
GTCGCCCAGG 
TTCAAGCAGT 
AGCCCGGCTA 
TCTCCAACTC 
CTTTTTATGC 
TTATTAGAAA 
TTTGAAATTG 
TATGACATTT 
ACAAACTGTA 
AAATAAAATG 
AATCACTAAA 
CGTATTTAAT 
AAAGCAAATG 
CGGAGTCTTG 
CTCCGCCTCC 
GGCGCCCGCC 
TGTTAGCCGG 
GTGCTAGGAT 
TATAAGAGCT 
AGACTTAAAC 
AAGATAATGA 
CTTATTAAGC 
CAGGTGTTAA 
AAATTTCTAC 
CATGTCTGAC 
GGAAATGGGG 
GGACGGGCAC 
TCACTTGAGT 
AAAAATAAAA 
AC TTAAGTGG 
ACTGCATTCC 
AAGCAACATT 
GGCAGAAGAG 
ATGCAAGAGG 
GTGGAGAAAG 
CAGGGATTCC 
TTTACAGGGG 
CACTTTTATT 
ATTATTGAGC 
CATTCAACAG 
ATAGCAGCTA 
AGTATACTTT 
TATGTATATA 
TGAGTACAAC 
ACCTAAGGAC 



GCTAATAGTT 

TCAAGAGGGC 

GGAATTACGT 

GGGTTGATGG 

CATCGTGGGC 

ATTTTTAAAC 

TTATAAATCT 

CTGTTTTCAC 

AACTTTTATT 

CTGGAGTGCA 

TCTCCTGCCT 

ATTTTTGTAT 

CTGACCTCAT 

TGCATAATTC 

GAATTCCAGA 

GGTATTGGCT 

TCATCATATT 

TGTGATATAG 

TTCCCCTCAA 

ACACTCCCTC 

CAGCAGATTT 

CATTTTTCTC 

CTCTGTCGCC 

CGGATTCACG 

ACCACGCCCA 

GATGGTCTCG 

CACAGGCATG 

CTTCATCATA 

TGATTTGAAG 

CTAGTTTGAA 

TACTTTCTTA 

GGCAAAACTT 

TGATGGCTTT 

ATGGTAGCAA 

AGAGGACTGA 

ATTGGCTCAT 

CCAGGAGTTT 

AAATTTGTCA 

GAGAATCACT 

AGCCTGGGCG 

ATGGAGAGAC 

AGTTTCCTCC 

GGTTGATCCT 

TCATGACCAG 

AGAGGAACTG 

TTTTCTGGGA 

CTTTCCCCTT 

CCTGGACATG 

ATACAAAAAC 

ATAAAAACAG 

ATAAGAACTG 

TTTTTAAATA 

CCTATTCTGT 

TTATGCTTAT 
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AAATGTTACT 

GGTAGGTTGA 

GAGAGACTTG 

CAGGGGACCG 

ATTGATCATT 

GGAGAGAAGG 

GAATTAAGTG 

GTATTGCTGC 

TGGAATATAC 

AGATGCCTTC 

TCAGCACAGA 

AGGCCACATT 

CAGAGGTCCC 

GGAGATGACT 

ACAGCCCTTA 

TGGGGCTAGG 

AAATGGAGTC 

CTCTTTTCCC 

ATTGGGAACA 

CTTACAACTG 

TAGTGGGACC 

TCTGCTGGAC 

GTGTAGCTTG 

ACAAAACAAA 

AGCTACTCAG 

AGCCATGATA 

AAAAACAAAA 

CCCTCTACTC 

ATATTAAAGG 

TACTATAAAC 

ACTTTCTTCA 

CTTTTGTGAC 

CTGGTGGATG 

TGCCTGCCTT 

TGCTGCTCTT 

CCTTTCCTTA 

TTGATTGCTG 

AGAAGACGTC 

AGCTCCTTTC 

GATGGTCTCT 

GGTGGTTATG 

GAACTCACAC 

AGGAATACGA 

GCTTGGCCCA 

ATTGGGTTGG 

CAACTGATAC 

AGAAATGTGA 

TGAAATCCTG 

AAG C C AG AAA 

TAAGTAAATA 

AATGGTAGGG 

AGCAATCAGT 

AATTTCAAAG 

CTTGAAATAA 



CAAAAAAATA 

AGGGTTGGGC 

GAAAAGAAAG 

GTGTTCAGCA 

ATTGGGTGTT 

TCAGCAGGTA 

CTGTGCTTTA 

CAGCATGTCC 

AATCGGGTTT 

CTCTTGTCTC 

CCCTTTACGG 

TCAGACTATC 

TGTGGCCTTC 

CTTAACGAGC 

ATCCATTTAA 

GTTAGATTAA 

TCCTATGTCT 

CACAGGAGGT 

AGCTCTGTTT 

AAGCCCATGG 

TAGGGCACAC 

ACAGTGGCTC 

AG CCCAGGAG 

CAAACAAAAA 

AGGCTGAGGT 

ATGCCACTGC 

ACACACCATA 

ACTACTAAAT 

TCTTAGGCTA 

TAAGTACTGT 

TCAACAAAAT 

TGTAGTTGGC 

CTGGCATGCC 

TGCAGCACCG 

AGGCTGCTCT 

TTTACCCATG 

GGAGAATAGA 

TCTGCAAATC 

ATCCTGTGGC 

AATCCAAGGA 

GACCAGTTAT 

AGCCATAGAC 

CTGTAGCAAA 

GGCCTGTCTC 

TTGGGATATT 

AATGATGTAC 

TAACTAAGGT 

TCATTTGTAG 

CAGAAAGTTA 

AGTTTATCTC 

GAAAGAGATG 

TCTAGTGTTC 

AG CTAGAAAA 

TGGATATTCT 



CAGAGGACAT 

TGCCCCTCCA 

AGACACAGAG 

TACGGAGGAT 

TCTCGGAGAG 

AACACGTGAA 

G ATATGC AT A 

CACCTCCAGC 

TACACTGAGA 

AACTGCAAAG 

GTGTCGGGCT 

ACATGGGGAG 

CTCAGTGTTT 

ATGCTGCCTT 

CCCTGAGTTG 

CAGCATCTCA 

ACTTCTTTCT 

GATGGCCGGA 

AAAAGGAGAC 

AAGACAAATG 

CAGAGAGCAT 

ACACCTTAAT 

TTCGAGACCA 

ACAAAATTAG 

GGGAGGATCG 

ATCTCAGCCT 

CCCAACCACA 

AGGTGAGTTC 

GTGACTCATT 

GCTAGGTACA 

GGAGATGTTT 

AGAGCTTTTT 

CAAAGGATCC 

CTCTGCTCTT 

GCTTGTTTTG 

ACCAAGGTAT 

AGAAGTGGAT 

CATTTAGAGA 

TTGGCCATCT 

GCCTGAAGAG 

GGAAGAATAC 

ACTGACAAGT 

TATTTAACAG 

CCTTTCCTGC 

AAGACAATAA 

AAGTCAGATG 

AATTTTTGTT 

CAACATGGAT 

AACACCACAT 

ATTGAAGTAA 

ATAAAGAGAG 

TATTTGTACT 

GAGGACATTG 

AATTAATTAC 



ATGTGGATAG 

CACCTGTGGG 

ACAAAGTATA 

CCCACCGGCC 

GGGGATGTGG 

CAAAGGTCTC 

CACATAAACA 

CCTAAGGCAG 

CATTCCATTG 

AGGCGTTCCT 

GGGGGACGGT 

AAACCTTGGA 

TGTGTCCCTG 

CAAGCATTTC 

ACACAGCATA 

AGGCAGAAGA 

ACACAGACAC 

AGAACATGGC 

TTGTGAACAG 

TGTACTGCGT 

ATTAACTCTC 

CCTACAACTT 

ACCTGGGCAA 

CCAGGCACGG 

CTTGAGCCCC 

GGGCAACAGA 

ATGCATCTGT 

CCAATCCCTG 

CACTCATTAA 

AAAGCAAATA 

TAGGCATCTA 

ATCAGTTTCT 

ATCCTGATGG 

CTGCAGGACT 

ATCTGCTTTG 

TATGAGATTC 

TACAAGTGGA 

GACCTTTCTC 

TCAGCACATG 

AAAAAAAGGC 

ACATCACTTT 

AGGACTTAAC 

CTTCAAACAC 

CATGTCACAG 

TGAACCAATA 

ATTCTGATGA 

TTGGCAAATT 

GGAATTGCAG 

GTTCTCACTT 

AAAGTACAAC 

ATTCATTAAA 

ACAGAATGGC 

AATGTTTCCA 

CCTGATCTGA 



ATAATGGAAG 

TGTTTCTCGT 

GAGAAAGAAA 

TCTGAGTTCC 

CAGGGTCAAA 

TGCATCATAA 

TCTCAATGAC 

TTTTCCCCTA 

CCCAGGGACG 

TCCTCTTTTA 

CAGGTCTTTC 

CAATACCTGG 

AGTACTTGAG 

TTTAACAAAG 

TGTCTCAGGG 

ATTTTTCTTA 

AGTAACAATG 

AGAGGGCAAA 

CAAAGAGTAG 

GAGTTTTAAG 

AAACTTTTAA 

TGGGAGGCCG 

CATGGCAAAA 

TGATGCGTAC 

GGGAGGTTAA 

GGGAGAACCT 

CTTAAGTACC 

GTAGCAGGTT 

ACAAATACTT 

ATCTAAGCTC 

CTCATCATTC 

CTAAATAGCT 

CCCTGTCTGC 

TCCCTTATCC 

CATCACATGT 

TGGAATTTCC 

ACTTAGAAGG 

CAGTGGTGAC 

GCTCCCAAGG 

ATGGAGTATT 

TGCCCACCTT 

AAGAATCTAA 

AGGTGCATTG 

GGGCCAGCAT 

CAACATCTTG 

TTATGAATTA 

TTTGTTTGTT 

GATACTACAT 

ATATGCAGAA 

AGAGATTACT 

ATAAGTTACA 

AATAG TTAAC 

ACACAAAGAA 

TCACTATACA 



AGATAAGATA 

TAGGTGGAAT 

AAAAGGGGTC 

CTTAGTATTT 

GGATAATAGT 

ACAAGGTAAA 

TTGAAGAGCA 

TCTCAGTAGA 

AGCAGGAGAC 

CTAATCCTCC 

CCTTCCCACG 

CTTTCCTAGG 

ATTAGGGAGT 

CACATCTTGC 

AGCACAGGGT 

GTACAGAACA 

TGATCTCTCT 

ACAAAACAGC 

AAAGGGTTCT 

GCAATAGGAG 

AAACATTATA 

AGGCGGGCGG 

TCCCGTCCCT 

CTGTGGTCCC 

GGCTGCAGTG 

GTCTCAAAAC 

AGTAC C AC AC 

TAAG CATGTT 

ATTGTGCATC 

TATAAACTTT 

TGAGCTCCAT 

CTACCAGTCC 

TTACCTTACC 

TTTGGGGTCT 

ATGTAAAGGT 

CCAAACCACA 

GGAGTATTCG 

TCAAAGATGC 

ATGTCCTCAG 

GTGAGTGGTA 

CTACTAACCA 

TTTTGAGTCT 

CTATCACTAT 

TTATGTCTAG 

AGCATAAAAC 

TGTCAATAAA 

CATGACAGGA 

TAAGTGAAAT 

GCTAGCTAAC 

AGAGGCTGGG 

GCTAGATAAG 

AGTAATAAAT 

ATGAGAAATG 

CAGTATGTAT 
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AAAAATAACA 
GCCAAGGTAA 
AACTCCATCC 
CCAGCTACTC 
TGAGCCGAAA 
AATAAATAAA 
ATACATATAA 
AAATATGAAT 
AACTATTCAA 
ATTGGTATAA 
CACACATCTA 
ATCGTCTTTT 
GGAGCCTTAC 
AAAAGCTAAA 
TAGCAAAGAT 
AATGATAAAT 
CAGGAGGCTG 
ATGGTGCCAC 
ATAAATAAAT 
ACATGTATCT 
ATGACAAGCC 
AAATGGCCAG 
TCAAAACCAC 
ATCCCCAGGG 
CATTTTATGA 
CAAATGAGGA 
AAGTCGTGAC 
GGCAGGGAAA 
CTGAGGCTTG 
GTTCAGGGCG 
CACGGCGCTT 
GGTGGCTCGA 
TGGGGGTGGG 
GGCCAGAGAA 
AGAGCTGAAA 
TCTGGAG CAG 
TCTCCATTAA 
AAAAGCTCTC 
AAAATTGAGT 
ATCATGCCAC 
CTTTGCAATC 
TTGTGCCTGA 
GGAAGGCCCA 
AAATTATAGC 
TTAATGGACC 
AAAATAGTCC 
GACAAAGGGG 
GGATGGGGAC 
TTCTCTATCA 
TCCATCACAG 
GGCCTGGTGT 
TAGACATTAG 
GATTATTTTC 
GTAATTAAAT 



CTATGGGCTG 
GCAGATCACT 
CTACTAAAAA 
AGGAGGCTGA 
TCGCGCCACT 
TACATAAATA 
TTATTATGTC 
CAATCCTCTC 
TCAAAATTTT 
GGTTAGACAC 
TGGACAACTG 
CAGTAAATGT 
CTCAAACTTT 
ATTATAAAAC 
TTCTTTAAAA 
TTCATCAAAA 
AGGCATGAGA 
TGCACTCCAG 
AAATAGAAAA 
GACAAAGGAC 
AAAACAAAGA 
TATGCACATG 
AATGAGATAC 
TGAGCAAGGG 
TATAACTGAA 
CGAATGGTTT 
CACGTCCTTT 
GGGTATCTTA 
GAGTCGGCCT 
TCCTCGCGAG 
TCCAGCTGCA 
GTGACTGACG 
GTGCTGAAGT 
ATTGACACCT 
GATACCGCCT 
GCCGGGCATC 
ATTCACATAC 
TAATGACCAA 
TCATGCCTTT 
AGAGATTAAT 
ATATAAATTA 
ACACCTTACA 
GACAAATGGT 
TGTACCACAG 
CAGTGTCCAA 
TGTCCTCAGG 
AAAGAGAAGG 
ACCCGATGCC 
GAAAAACAGA 
CACTTTTCTG 
GAAATAAATA 
GAGTTACAAG 
ATTTTTATTT 
CTAATTGTTA 



GGCGCAGTGG 
TGAGGTCAGG 
TACAAAAATC 
GGCAAGAGAA 
GCACTCCAGC 
AATATTTTTT 
AATTTGAAAC 
CAAGTTGATA 
ATTAAAATGC 
AAAGATCAGT 
GTTTTGACAA 
TTCTTGAACA 
ATGCAAAAAT 
TTCTTTAAAA 
CAAAACAACA 
GTGAAAATTT 
ATCACTGGAA 
CCTGGGTGAC 
GAAAAAGAAA 
TCGCACCTGG 
GTAAAAGTTT 
AAAAGATTTT 
TTCACATTCA 
TGTGGAGGAA 
TTCAGTTTTA 
TTACGCACAA 
CAGAACTTTA 
AATTTCACCA 
TCTGACCACG 
TCGCGCAGCC 
GATCCAGGCG 
GGCCTTGAAC 
CATTCTTGGG 
CGTGACATCC 
TCTGGCTGGT 
TGTATTATCT 
ACGAAAATAA 
GTCCTACACG 
TCTGAAGTAA 
TGGCCCCAGA 
ACTGGGAAAA 
AGAACCAGGG 
GTCACTGGTC 
AAAGGTGAAA 
CACATTAAAG 
GAGTTTAGGT 
AG CGCTGAAG 
CGGAATATGA 
ATTACTCTCC 
GTGATAATTT 
ATAAAATTTT 
GATAACTGTG 
AATTATTTAA 
ATATTTATTA 



CTCACACCTG 
AGTTAGAGAC 
AGCCAGGCGT 
TTGCTTGAAC 
CTGGGTAACA 
AAAAAAAGAA 
ATAATTTTGA 
TACTTAAAAG 
TATAGTAATC 
GAAACAAAAT 
AGGTGTCAAG 
AGTAGACATC 
TAACTCAAAA 
AATAGGAGAA 
GGTTTATAGT 
GCTTTTCAAA 
CCCGGGAGCT 
AAAGTGAGAC 
AATCACAGGC 
AAAATATAAG 
TCAACAGACA 
AAACATCATT 
ACAGAATAGC 
ACTACTCTCA 
TGTATAACTG 
AACATGAGAC 
ACCTGTTTGC 
CAGCCTCAAA 
AGTCCTGCGG 
CGCCCTGCTC 
GCAGCCCAAG 
GCTCCCAGGA 
GCCCCTGGGG 
CTGGAAGAGA 
CCCTCCTCAC 
GGTTATTTAA 
AAATTAAAAA 
ATAGTGAATA 
TAGACGCCCA 
ATTCTTTAGC 
CCTCATTTAG 
ACTATTGCCC 
CACTTTCACC 
ACGTTTCTTT 
CAAGTGCTCA 
CTTGGAGAAA 
ACTGAGGACC 
CAGTTTGGAG 
TAACCAGAAA 
AATGTGTTTT 
AAGAATTAAA 
AATTATAATT 
TAAAACCTAT 
TTATAAATTA 



TAATCCCAGC 
CAGTCTGGCC 
GGTGGCATGT 
CCAGGAGGCG 
GAGCAAGGCT 
CATCACTATG 
AAAATGAAAA 
GAAAAAAGTC 
TGGAAAGTAT 
AGAGAACCCA 
GCTATTTAAT 
CGGTGTGGGG 
TAGACCATAG 
AATCATCAAC 
TTATAAAACA 
AAACATTATA 
ACAGGTTGCA 
TCTTCCTAAA 
TGAGAGAAAA 
GAACCTTATA 
TTTCACAAAA 
AGTTACTAGG 
TAATGTTAAA 
TATATTGTGA 
AATTACGGAT 
ACAAATCTGT 
TGAAGTACGT 
GAGGCCATTT 
CTATGAAAGA 
CAGCTGGGGA 
ATTTGGTCCA 
CCCACATCTG 
GCGGGCATGG 
AGTACGTTCA 
CTACATACTT 
ATATCTGGTT 
AAATTTTAAA 
AATTTTTTTG 
GAGAAGGGAT 
AGACCGTGTA 
TATGTTACAT 
CAATATTATA 
CAGTTGGTAA 
TATAATTTCA 
GGAGTGACAT 
AGAGACCCAA 
CTGCCTGTGG 
GGGCCTGAAG 
AGGTATTTCA 
AAAAAATGTA 
AAATATAAAA 
AGTAATTAAA 
TTAACATTTA 
TTTTAGAATT 



ACTTTGGGAG 
AACATAGTGA 
GCCTGTAATC 
GAGGTTGCAG 
CTGTTTCAAA 
CACCCCATAT 
AATGAAACAC 
CGAGGGCTTA 
TTCAGAATGA 
GAAATAGATT 
AAGTAAAAAA 
GAGAGGAGCA 
ACTTAAATGT 
ACCCTAGGAT 
TAAATAACAA 
AAATGAAAAG 
GTGAGCCAAG 
AAATAAATAA 
TATTTATAAT 
ACTTAGTAAG 
GAAAACATAC 
GAAATGCAAG 
AGGACTGACA 
ATGTAAGAGG 
ATGAGAATCT 
AAGAAATATA 
CAGTAACAAT 
CGTGGATCCG 
GGAAGCCGCG 
CACAGGTGGT 
GCCGCCAAGG 
GAGAGGGAGG 
ACCTGGGTAA 
GTGTCACTCC 
TTCTAATTTG 
ATTTAAAAGC 
AAAAAGAAAC 
TGTGGTCCCT 
CGACTTACCC 
TATGAACGTC 
GCCTAGCGTT 
TTTCAGGAAA 
ATGAAACCAG 
CATACAATCT 
CAAGATGTAA 
GGAGACACAA 
ACTGAAGTGA 
GACTCTTCTA 
ATTTATATTT 
TCACAGTGAT 
ATCTTTTATA 
TTGAAATACT 
ATATTTATCA 
AAAAATAAGT 
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GTAGAAGCGA 
GAGGATTGCT 
CAATACAAAA 
GCTGAGGTGG 
CCACTGCACT 
CTTAAAATTT 
TCCTATAACC 
ATGATAAATG 
ACATATCCAA 
CCAGCCTGGG 
AGGATGATAA 
AGTAATTTTC 
AATATCCAAC 
ATAAAATGTC 
AAGAGAAATT 
TTGTTTTCAA 
GAGCGTGGTG 
TTTAGGCCTG 
AAAAAAAAAA 
CAAAATTTGA 
AAATTAATAC 
TGAAGAAAAG 
TATTTTTAGA 
AG CAGGC CAA 
ATGAAAAGAT 
CTGCATACCT 
TATGTAGCAA 
AAGACAGTTT 
ACACTCCTTA 
TAGATGTTTA 
AGTAGGTAAG 
GAAATG CATT 
CCAGCCTGGT 
CCCAGCTACT 
GTGAGCCAGT 
AAAAAAAAAA 
GAAACGATCA 
AC CTG AAAAG 
AGGGCTTAGC 
CAGCAGTGTA 
AATGTGCCCA 
GTGCATGTTA 
GTACAGAGGA 
GCGCTGGTGG 
TCAATCTGGT 
ACTGTATGAC 
AAATTGGGTG 
TCTGTGATAC 
ACAACACCAA 
TGTAAGTTCA 
TTTTTGCTTC 
AAACTCTCAC 
TAGTTGAGAT 
AGAAAAGACC 



GGCATGGTGG 
TGAGCCCAGT 
AAATGAGCCA 
GAGGATGACT 
CCAGTCTGGG 
AAAATGAAAG 
AGAACAATAA 
GCAATTGCAA 
CCCTTTGGAA 
CAACATAGTG 
AGTAGACAAT 
TTCCAACAGT 
AAAAAATATT 
TTAAATTTTT 
ATTTCAGTTA 
AAAGTGAAAG 
ACTTATGCCT 
GAGTTCGAGA 
AAAAAAAAAG 
AAAACTGTAA 
TTTGGGTGCA 
CCACACATTG 
ATATACAAAG 
TGACCTGAAC 
GCTTGACATC 
AGTAGAATGA 
TAGTAACTTG 
GGTGGTTTCT 
GTATTTATCC 
TAGCAGCTTT 
TGGATAACTG 
CACTTTGGGA 
CAACATGGGA 
CGGGAGGCTG 
GCCACTGCAC 
AAAAAAAAAA 
AGCCATGAAA 
ACTGCATACT 
CCGGGAAGAA 
CAACAGCAGA 
GAGTAGCAGC 
AGGGGGATTA 
AAGAAGTCGA 
GCGTGTCTTA 
CCGGAGTAAA 
ACTCTAGAAA 
ACAGGAAGTG 
TATAATGGTG 
GAATAAACCC 
ATTGTAATAA 
AATAGTTACA 
ATTCAATGTG 
CCTAGCAGAT 
AGAAAGCTAG 



CTCAAGCCTG 
AGTTCAAGAC 
TGTGTGGTGG 
TGAGCCTAGG 
CAACAGAGCA 
CATACTACTG 
AGATAAAAAG 
ATATCCTGTA 
GGCCAAGGAG 
AGACCCTGTA 
ATTGAAAGCC 
GCAAAAATGA 
GACTATCTAA 
ATTTAAAAAA 
TGAAGATTTT 
ATTAAGTTAC 
GTAATCCCAG 
CCAGCCTGGG 
AATACCAGAC 
TTTATTTAGC 
TTTCT.TTTCT 
GCAGGTAATG 
AATACTTAAA 
ATCTGTTCAC 
ATGTCATTAG 
CCAAAATTTA 
TTCATTACTG 
TACAAAAGTA 
AAAGGAATTG 
ATTCATAATT 
TGGTACTTCT 
GGCCGAAGTG 
AAACCCCAAT 
AGATATGAGA 
TTCAGCCTGG 
AAAAAAAGAA 
ACACATGAAG 
ATATGACTCC 
TTCAAGGGTG 
ACAGGTACTG 
TCAGGGGCAG 
TGCAGAAATT 
TAATGTCCTG 
TGGAGAGGTG 
GTCCCTGCCT 
AGACAGTAAC 
TTGAAAAGGC 
AATACATGAC 
TAATGTAAAT 
ATGTACTACT 
GTTGAAGTAA 
ATGGTCTTTG 
CGAGTCTTCA 
CTCTCTCTTT 



TAATCCCAAC 
CAGCCTGGGC 
TGCGTGCCTG 
CAGTCAAGGC 
AGACCCTGTG 
ATACAGAATT 
GAGAGTGGAA 
GCAGAACAAA 
GGAGGATTGT 
TCTAAAAAGG 
ATTTTCTGCA 
ATAGATATTA 
TAGTATCTAA 
AGAAAACCAT 
GTTAGAAAAC 
CAAACAGTTG 
TACTTTGGAA 
CACTGTAGCA 
CTTGCTAACA 
TTTAGAGTAC 
CAAAGGACTT 
TTTGCAAAAG 
ACTCAACAGT 
CAAAGAAGAT 
GGAACTGCAA 
GAACACTGTC 
GTGAGAATGC 
ACCATACTTT 
AAAACTTATC 
TATCCAAAAC 
GAATAATGGA 
GGTGGATTGC 
TAGCCGGGCA 
ATCGTTTGAA 
GCAACAGAGC 
AGAAAAGAAA 
GAAACTTAAA 
AACTGATGCA 
AAGTGGTGGT 
CTCCTTGCTG 
TTCTGCAGTA 
TCTAGAAAAA 
TTGTTGCCAT 
CTTTAACCTC 
CCGGAGTTCA 
TATGGACACA 
AGAACACAGG 
ATTATACATT 
TACAGACTTT 
GTGGTGCTGG 
ATGTTTGTGT 
GAGGTGGGCT 
TGATGGGCAT 
GCCATGTGAA 



ACTTTGGGAG 
AACATGGAGA 
TATTCCCAGC 
TGCAGTGAGC 
TCAATATACA 
GAGTAGAGAT 
GAAGGTATGT 
ACAACAAAAT 
TTGAGC CCAG 
AAGAAAGAAA 
AATACATAGT 
GTTGCCTGAA 
GCTAGTAAAT 
ATTTATAAGA 
TATGAGAAAA 
CTAAAGAATA 
GGCCAAGGCA 
AGACCCGTCT 
ATAGCAAAGA 
TCTCGTGATA 
GCAAATTTAC 
ACAGATCTGA 
AAGAAAATAA 
ACACAGATGC 
ATTAAAACAA 
AG C AC C AAAG 
AAAATGTGCA 
TACCATAAGA 
TCCACACAAA 
TTGGAAACAA 
ATGTTATTTA 
TTGAGG CCAG 
TAGTGGCGTG 
CCTGGGAGAT 
AAGACTCCTC 
AAAGAAAAAG 
TGTATGTTAC 
GGGCAAGCAA 
GTTAGCAACT 
AGCAGGGCTA 
ATATACCTGC 
GAGTGGTAAC 
GGCAACGAAA 
GTCCCTGTTT 
CTCCTGCTTC 
GTCAAAAGAT 
ATTTTTAGGG 
TGTCAAAACC 
CGTTGATAAT 
ATGTCTATGG 
TTCCCACAAT 
CTTTGGGTGA 
GATGGGACTG 
GACATAGCAG 



GCTAAGGTGG 
AACCCTGTCT 
CATTCTGGAG 
CCTGATCTTG 
TATGGACAAA 
GCAAAGCTAG 
CATGAATTTC 
TGTAGATAAA 
AAGTTGGAGA 
AAAAAAAAAA 
GAATTTGATC 
ATAAAAATCA 
TTGGCCAGTT 
AGAGGTGATA 
AAACTATTTT 
CCAGATGGCT 
GGAGGATCAT 
CTATTAAAAA 
TCAATTAATT 
TGAGATTGCC 
AAAGAAGTGT 
TGAAGAACAA 
CCTGATTTAA 
AAGTATGCAT 
GTAGATACCA 
GTTGCAAAGA 
ATCACTTTGG 
TTCACCAATC 
AACCTGCACA 
GATGTCTTTC 
GAGTTAAAAA 
GAGTTTGAGA 
AGCCTGTAAT 
GGAGGTTGCA 
TGTCTCAAAA 
AAAAAGAAAA 
TAAAAAGCCA 
GCCAAAAATT 
TTTACTGAAG 
ACCCATAAGT 
TTTTAGTTAA 
TTCGGAGTAG 
AACTGACATG 
CGGCTAGTCT 
CTGCTTCACA 
TAGTTGATAG 
CAGTGAAACT 
CATAGAAAGC 
GACGTGTCAA 
TGGGGGGACA 
GCATATGTAG 
TAGTTAGGTT 
GTCCCTTATA 
GAAGGTAGCC 
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ATCTGCAAGC 

ATAAATTGTC 

AGACTGTTAA 

AGTACCTTTG 

CTAGGACAGT 

TTCTCTGTTT 

AATCTTTCTG 

AAAGGTTTTA 

TTCACCCTAC 

GGAACTTTAG 

TGATTTTCTG 

CACATCTAGA 

GGCTATTTTA 

AAAAATCAGC 

CTCTGTAACT 

AATTCAGTGG 

GTCTCAGCCT 

TTCTTTTAGA 

TCCCATCTCA 

CTGTAAGTGT 

TGGGTCAGTT 

TTCAAATTAC 

TCTTTATTAA 

TGCACCCCAA 

TTTATTTTAT 

TATTCAGCAA 

TTAACAATGT 

CTGTGGGATG 

CCTG CACAAA 

GCATACCGAG 

AGAAAAATCT 

TAACAAAAGA 

GTACTGAGAT 

AAGAATACAT 

AATTGTGGGA 

TTCTCTGGTG 

CTTCCCCGAG 

AGGGCAGAGG 

ATACTTCTTA 

CCTCTATCAT 

TTTATATCTC 

TGTTGCTTCA 

AACATTAATT 

GGCTCAAGCA 

ATGCCCGGCT 

GTGTTGAACT 

CAGGCGTGAG 

AG CAATTTTG 

GTCATTACTC 

GCTCTTCTCT 

AGATTTTAAA 

CCAGAACTGA 

ATCTCATTGT 

GAACTCAGTA 



TAGGAAAGGG 
GTTGTTTAAG 
TTGGATTAGA 
TTAAGCTTTT 
TTAAATAGTA 
TTTAATATGA 
ACTTCTTAAG 
TGATAGGTTT 
TTTTGTGTCC 
TGGCTCTCTT 
GAGTTTCCAT 
CTTGCTTTAA 
TAGAACTTAA 
CATAAAATCA 

CATGACCATG 
CCTGAGTAGC 
GATGGGAGCC 
GCCTCCCAAA 
CTGAATTTCA 
AGGCCATTGG 
TCTCAAAATT 
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AAGCCAGTCT 
AGTCTGGTAG 



135/162 

CCTTCACAAA 

TCACTCAGGC 

AATTTCCTTT 

ATATCAATGA 

TGAGTGTGAA 

GAACTCCTGA 

TGTGGCCATG 

TAGGACACTA 

TTTTTTCCAT 

GGATCCCAGG 

ACACAAACAA 

ATGGGAGAGA 

TCTAATCTTC 

TCTTCTTTTT 

TTTTTTTTGA 

GCTCACTGCA 

TGAAACTAAG 

TTGCCCAGGC 

GTGACAGGAT 

TTTTGTATTT 

TTTCTTTTTA 

ATTCCAGTAT 

GATTTATCTA 

ATTTTATATT 

AAACTTTTAT 

TAGGGATATA 

TCCCCTTTCT 

GAGAAGAGCT 

CTCGGTTACC 

TATAGAACAA 

TTCATCTTAG 

AGACAGAAAC 

AAGAGGATTT 

ACAAGGAAGA 

TTCTTTCCCT 

GCTGATAAGA 

CCTTCACAAT 

TTTTTAATCT 

GGCATATTCT 

ACTTCTGACC 

AGATGGAGGC 

TCTCAGAGGT 

TTGAGGCACA 

CTTGGCCTCA 

TTTTTAGTAG 

AGTGATCCAC 

TGGCCATCAA 

TGGTCATCAT 

GGTAATTTGG 

AAGAACTGCC 

GTAATTG GTC 

AAAGTTCAAT 

TAACCCTTCT 

CCTCCAGGAC 



GAATCAACTC 

TGTGGTATTT 

TGGGGATGGT 

GTTTGTAGGC 

GGAGATTGTT 

TTTTCAGCCA 

TACTAAATTC 

GAGTAAAAGA 

CCTACAACTT 

GGTAATTGAG 

GACCTGGATT 

AATAAACTTG 

AAGGGTACAT 

TCTTTTGTTC 

GACAAGGTCT 

GCCTTGCCCT 

GCACATGCCA 

TAGTCTCAAA 

TACAGGTGTG 

ATCAGTCTGT 

AAGGTTTTCA 

ATATTCTTTT 

GCAGGACTTA 

TCTTCTGGCC 

TTAATTGTTT 

GAATTTCCTC 

GGGGAAGACC 

TATGAATATC 

CACTGGGTCC 

GTGCAACTGG 

GTTCTCCATC 

AAAGCATGTG 

AGAACTTGGG 

CGAAAAGGAC 

TTTTTTTTTT 

GTCTAAAGTC 

TTCATATCCT 

ATTTTTTTTC 

GCTACCCTTC 

CTGGGGAACA 

CCAGCCATGC 

GATGTTGTAG 

GCATGATCTT 

CGAGTAGCTG 

AGACAGGGTT 

CTGCCTCAGC 

TTTTTATGTC 

TAATGTTGTT 

TCTTTTTCAA 

AACAGCCATT 

CTAGCAGACT 

TTTACTCATA 

CTCACACTTT 

TGCCGCTTAG 



AGACCTCAGA 

TGTTTCAGCA 

GTGTGGCGGG 

1*^**1* T T T 

GGTCATCTAT 

ACTACCCTGG 

TGGCTAATGC 

GAGCTGTTGC 

GGGTTGTGAG 

GGGTGGCTGG 

TTCTGGGCTT 

TTTCAGCCAC 

GAATTGCTTT 

CCCACATTAT 

TGCTCTGTCA 

CCTAGGCTCA 

CCATGCCCAG 

CTCCTAGCCT 

AGCCACCATG 

TTAGATTTTC 

AATTTATTTG 

GTTCCTATTT 

TATTCTTTCC 

TTCCTTATAG 

ATTTTATTAT 

TAAGCAGCAT 

ACACTTACAT 

CAGATTACAT 

TATTACCCAA 

CGATAATCCT 

TGTGGCCCTA 

CATTGTACAA 

CTTATATAGC 

TTTGAGTTTC 

TTTTTTTTTA 

TCTGGTGACT 

GCTTTTAGGC 

TCAATTGTCT 

ACTTACTACT 

TGGGCAATAG 

CTCTGACATC 

GACTTCAACA 

GGCTTACTGC 

GGTTACAGGC 

TCACCATGTT 

CTCACATAGT 

AACTCTAAAT 

TATGTTTTAG 

AATGAAGTTA 

TCAGCAATAA 

GGAAAATACC 

ATTCCCTTTT 

GCTTGGCTGT 

ATTATTAAAC 



ACAGTGAGAG 
GCCCAACCTA 
GGGTGCGGGG 
TGGTCATTGA 
TCGATGTCCC 
AAAAAAAGCT 
AAGG CAAGCC 
ACACATGCTC 
TATGATGGCT 
AAGGAATCTG 
CCCAGACTTC 
TGTCATTTTG 
TCCTTAAAAA 
TTAGTTGGAG 
CTTAGGCTGG 
AGCAATCCTC 
CTAATTTCTT 
CAAGTGATCC 
CCTGGCTGCT 
TTTCCCTTCT 
CATC TAATTC 
TCTTCTGTAT 
AT AACTTT C C 
TTTCCACAGG 
CATTCTTTCT 
ATGCTAGGCT 
TAACACAGGA 
CTTCACTGAT 
GTCTGGGTCA 
TCTGTTGGGG 
TCAAGTAGAC 
AC AC AG GGG A 
ATTTTAAGAA 
TAGTGCAGTA 
AAAAAAAGAC 
AACTTTTGTT 
AAACAGGGAG 
TCAACTCAAA 
TACAACCCAG 
TTTGAACTCT 
TAGACACAAC 
AATATCAGTA 
AGCTGCTGCA 
CCCTACCACC 
GGCCAGGCTG 
TCTGGGATTA 
TATAACATTT 
TTGTAGTCCT 
AGGTCTATTT 
CTATTTACTG 
AAATTCTTTT 
CATTTGAAGC 
TTCTCAGGTA 
AACATGTCAG 
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TGGTTGGAAG 
GCGGATAATT 
GGAGAACGCA 
ATCAAAGCCC 
CTTGCTTTGC 
TTCAACATAA 
ATGACAGCGA 
TGCTGGTTAA 
AAGGAGAATA 
GGAAAGTAAA 
AGTCCAGAAA 
AAGTAAGTGC 
GCTCTTGCTT 
CGGTTCCTTC 
CCAAGACACA 
ATCCACGAGT 
GATCCAGAGT 
TGAAGAACAC 
GCTGGAGTGC 
TTCTCCTGCC 
AATTTTTTTA 
CTCCTGACCT 
GAGCCACCAT 
TATTGTTTCT 
TGTTAACTAT 
AAGCTTATGT 
GGTAGTTGTT 
GTCAAAGGAA 
TAGATTGGAG 
ACAAAGAAGC 
TAGAAAGGTA 
TAGAATATAC 
CCAACAAGTC 
CCATCCAAGC 
CTTCCACCAT 
ACCCCGGACA 
TCTTCTAACA 
AGGATGCCAA 
TCCCATGCAG 
TCTACTGAAT 
AGCTTGTTAT 
TAAGCTCTTC 
ACTGAGGACC 
CTGATAACCT 
GGATGTTCTA 
GGTCTGTGAC 
GTTTGTTCTT 
ACATGGACAG 
AGAGGAGACC 
CTATGATTTG 
CCCTACACCT 
GGGTGCTTCC 
TCAAACTCTA 
ATATTACCTG 



AGTCAATGTT 
GCAGCTTTCT 
GACCTTCTGA 
TGACCCATTA 
CCCAGTCCAG 
ATCAATTATA 
AGAAAACAGA 
ACCAAGGAGC 
TTGATAAACC 
GATGTGGCAG 
AATTCCCACA 
TACATAGCCT 
CATGCCAGTG 
GCCAACATCC 
GGGGAAGGCA 
CACTTGTCTC 
CACACTAACT 
AGGTTATTTT 
ACTGGCACTA 
TCAGCCTCCG 
AATTTTCTGT 
GAAGTGTTCC 
GCCCAGCCAC 
TATAAACTGG 
AAAGTGGAAA 
TTGCTTAATG 
GGGGTGATCA 
TGAGAAAAGA 
GCTGCAAAGG 
AGGTGGTGAG 
GTGGTGCATT 
TCTGCTGCTT 
TGTGCACTTT 
CACAAGGGGT 
TTGG C AC AG A 
TCTTCCAAGA 
ACATGTAGTA 
GGTACAGAAC 
GACTTCCAGG 
AACCACCAAC 
TGGAAGACCC 
TCTTTCCCCT 
AATATACATG 
AAAGTGAGAT 
ACAAGAGAGT 
AATTAAGGAA 
CCAAATAATC 
ATTAACCAGG 
ATAGGGCACA 
GAGGGTCTGC 
GCCACTTGAG 
TGTTCCTTGT 
TTCAGTTAGC 
AATG AC C C AG 



ATTTTGATTT 
TTCATTCCCT 
CTTGTGGGTA 
AAGACGGATG 
GTTAACCATC 
TATCCACTAA 
CCAAACGTCT 
TTCTGCTCTT 
TGGAAATAGG 
CCAGTATTCC 
AGCAGGGGCT 
TTCTTTTTGC 
CCCCTCTGCA 
ACTTCAAAGT 
GTAAATCTCC 
CGATCCTCAG 
GCAAAACAAA 
ATTTTATTTT 
TCTCAACTCA 
GAGTAACTGC 
AGAGATGGGG 
ACCCACCTCG 
AAGTTATTTT 
GTGAGCTTAG 
TTACCGTATT 
CTTGGTAAAA 
GGCCCAACAC 
CAAGTTAAGA 
CCCTAAGCTC 
GACGTGAGGG 
AAGCGTAGCT 
GAGATAGTAG 
CCAGAGGCTA 
TTTATGCCCT 
GCTTGGTGTT 
CTCTTTTACA 
ATAATGATAT 
TAACCTGTTA 
AATCATGAGA 
ATTGAGAATC 
AGGTCTCATG 
CAGATAATGT 
AAAAATATCA 
ACTGAACAGT 
TAAGAAACCA 
TTCAAGTATT 
CCTCACTTTT 
CTTTCCGCCC 
AAGAAGCAGC 
ATTCCTCTGC 
GCCCAGAGGA 
GGATTTCCCA 
TTTTATCAGC 
GAAAACCCAT 



TTCTGTTTTG 
ACATGAGTTC 
CCCCTACTCA 
GAGACAGCAA 
TGTGGTATTT 
AATCTCAGCA 
GCCCTTATGG 
TTCCTTAGTC 
GCCGGAGAGT 
CGTTATAAAA 
GCTCATG CAG 
ACAGCCTGAG 
CATTTTCCAT 
AACGTCTTCC 
TGGAAGATGT 
AGAGAATTAG 
ACAAAACAAA 
ATTTTGAGAT 
CTGCAACCTC 
GACTACAGGT 
TTTCGCCATG 
GCCTCCCAAA 
CAATAAAACC 
GCAAATCATT 
TGTTGCAGAG 
TTCCTGGTAC 
CAGGCCGTGG 
GTG CATAAAG 
TGGGAGCCCA 
TAAACAGGTG 
GTGACAGTTT 
AGGACACGTT 
TGAGGGGTTT 
AGGCTTAGAT 
CCAAAGGCCA 
TTATGACAGA 
CATCAACATC 
ATATGGTTAC 
CAGTTGAGCA 
AGAGAGGGAA 
ACACATGCCT 
TCCATAAGCA 
GACTAGAATC 
ATGCAGTTTT 
CTGTGCTACT 
CAGAAACACT 
CCCTGTCTTG 
TCTGGCTTGG 
CTTGGGAGTA 
CTCTGGGCAC 
GGTGATGGCT 
ACTCCTCACC 
CTGACTCACA 
GTTGAGAAAT 



TTTTGTTTTA 
AAATGGCAGC 
TCACCTGAAG 
CATACGATCA 
TTAGTTG CTA 
CTAGTCTAAC 
GATTTATATT 
ACCTGGGGGA 
ATCAGAGAAG 
GGATACAACT 
ATGAAGGGAA 
GGTCCAGAAT 
ACAAACTCCT 
TGAGGTGAAG 
GTCCTGATTC 
TTCGTGATGA 
CAAAAATAAT 
GGAGTGTTGC 
CACCTCCTGG 
GCGCACCACC 
TTGGCCAGGC 
GTGCTGGATT 
AGCCTGTGTT 
TAACTTTCTG 
AATGGTGGGT 
ATGGTAACCA 
GGGCTACAAA 
TGGGTCCAGG 
CACTATTTAT 
AGGGCATGAG 
AGCATTTTCT 
TATGAGTGAA 
TATGCCCTGA 
TTGTGGTGCG 
CGAGGGGTTT 
CAAGCCAGTC 
ATCTTCGTCT 
CATCCTGTCC 
GAAAGATACC 
AATGACTCAG 
AGTCCCATGA 
TTAGTATGAG 
AAACAAGACA 
AAAAATAAAA 
GAGTTAAATG 
TCCTGTGCTG 
CTCTGTGCCC 
TTCAGCCAAT 
TTCAGTACCC 
ACTCTAGTAT 
CTCTAACTGT 
TTTGTAAATA 
GAAGTTTGGG 
TAAAATGTTT 



AATGCAGTTG 
AAACAAACTA 
ACCCTTGGAA 
TCACTATTAT 
AGTC CAT ATA 
TACTAAGGAA 
ATTTTCTCTG 
GGCAGAAACA 
GAAGCCTTCG 
CCGGCCTCAT 
GTTGGGGGAG 
CCAGACTGAG 
AAATCCCATC 
CCTTCACAAC 
TCCTGGGTGT 
GCTGTATCTG 
TTTGTTGCTG 
TGTCACCCAG 
ATTCAGGCAA 
ACAAGTGGCT 
TGGTCTCAAA 
ACACAGGTGT 
CAAACCCAAC 
AGCCTCAGTT 
AGGATTGAAT 
CCTAATAAGT 
GTCCGGCGGG 
GTGCCAGCAC 
TGGTGATCAA 
GACATGGGGG 
TTGACACATG 
AAGCAAGGAA 
GCCCTGGGTT 
GCAGGGCAGC 
TGGACCCTGG 
CTG CTTCAGC 
TAATTATTCA 
AAAGTTCTTC 
TTTTCCCTTC 
CTAATGTCTT 
CTTTTAATTG 
ATAATAATAC 
GAAAAAAGAT 
AATGGTAATA 
TTGATCAGTT 
GATGCTCTCT 
AGGAAGGCTG 
GGGAAGCACC 
CAGTCCCACG 
AGTTACAGCT 
TCCTAGTTCT 
CCCTCCTTTT 
GTTTCAATTC 
ACGGGGTGGT 
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AATACCACTT 
TGACACATCA 
ATACTGTTAT 
GGGCCAGGTG 
ATCACTTGAA 
CAAAAATACA 
GGCTGAAGCA 
GCCACTACAC 
AAGAATGAAA 
ATTAGTGAAT 
GAGAAATGGA 
ATAGATCAGG 
AGAAGCGCAA 
AGAATATGCA 
GAAAACAGTC 
AATATAAGAT 
TGTAAACTAC 
TATAAAAATA 
CTTATCTTAG 
AAAAAGAACA 
ATAACAGTGA 
AGTAGCTTCT 
GTAAGACAGA 
TAATAAATCT 
AAAATATTTC 
AAAGAAATGA 
AAGCAAAAAT 
CAATCTCATA 
TATAGAACTC 
CATATTATAT 
TGTTCTACAT 
TCTTTAATAT 
GTGTACAATA 
CTGAGGCGGG 
ATTCCATCTC 
CCAGCTACTC 
TGAGTCGAGA 
AAAAAATTAA 
GGCTGAGGTG 
ACTCGGCTTG 
ATCAATATAT 
ATCTTATTAG 
TATCCTCTAT 
TCTCAAAAAA 
GGGGAGGCTG 
TACTGCACCA 
ATATATAAAT 
ATTATGAAGA 
GAAGACACTG 
CATGCAATGT 
GTCTCCAATG 
GTGTGTATAT 
TCAGAGCAAA 
CTCCCCCACC 



AAGAGAAAAA 
ACAAAAACAT 
AGTATTATAT 
TGGTGGCTCA 
GCCAGGAGTT 
ACAATTAGCT 
CAAGAATCAC 
TACAGCCTGG 
GGAGTCACCT 
TCATGTTTAG 
CAAATCTACA 
CAGACCAAAA 
TCTCATAGTC 
GTTTTCTCAG 
TGAACAAGTT 
TATAAACCAA 
TAATAAGTCA 
TACAATGAAG 
AAAAATTAAA 
TAGAAAGCCA 
AGTTAAAGAA 
TTTAAAAGAA 
AGGCACAAAT 
TATGCTACTA 
TGAGTAAAAC 
AATCACTACT 
GGAACTTGTC 
CCAAATCCTA 
CTGCACAAAA 
GTGTATGTAT 
ATTATATATG 
ATTCATCTAG 
ATCAGGCCAG 
TCAATCACTT 
TACAAAAAAT 
GGGAAGCTGA 
TTGCGCCAGT 
AAAATTAGCC 
AGAAGATCAC 
GGCAACAGAG 
ATATTATATG 
ATATATAGTA 
CCATATTAGA 
ATTTAAACAT 
AAATGAGAGG 
CTCAGCCTGG 
AAATATGTAT 
CACTGGGAGA 
GTGGGATGGT 
TGTTGTTGAC 
AGACATACTT 
AGCCTTCTCT 
GATGACAGCT 
TCTTTCTCTT 



ATATCAATTG 
ATAGAAAGAT 
CAAAAGATAT 
TGCCTGTAAT 
CAAGACCAGC 
GGGCATTGTG 
TTGAACCGGG 
GTGACAGAGA 
AAAAAAGATA 
AATTGTGTTA 
ATCATCATGG 
GAAAGAAATA 
AATACATAAA 
GTATAGATGG 
TCAATAAATC 
TAATAAAAAG 
TTAGAAGATG 
CTAAAGCAGA 
AAGCTGAACA 
AATATAATTT 
AACAAGAATG 
ATTTAATAAA 
GGAATGAATA 
ATAAAATTGA 
TCACTCAAAA 
TTAAATTTTA 
AGTTTTATAG 
CAAAGAATAG 
TAATATAAAT 
TATATATGTT 
TATAGTGTAT 
ATTGTCATAT 
GCACAGTGAC 
GAGTCCAAGA 
ATGAAAAATT 
GGTGAGAGGA 
GCACTCCAGC 
AGGTATGGTG 
TTTAGCTCAG 
TGAGAGCCTG 
TACCAATCAA 
TTCCTTCTCC 
GAGAGGATAC 
CAGC CAGATG 
ATTGCCATTG 
GAGACAGAGG 
AGAGAGAATA 
GAATACTATA 
TTCATTACCA 
TGAGCTGGCA 
AACAAAATAT 
CACCAACCTA 
AAATTTTTTT 
CCTCCTCCTC 



GATTTTTAAA 
TGGAAGCTAA 
TAAGTCAGAG 
CCCAGCACTT 
CTGCCCAACA 
GCACATGCCT 
GAGGCAGAGG 
GAGATTCTGT 
ACACAATTTT 
ATATACAAAG 
GATGTTTTAA 
AGGGAAGATA 
GCTCAGCAAT 
AACATGCACT 
TGTATTACAC 
ATGACTAAAA 
TATAGAATGG 
ATTTTAAGGA 
TTAATGAGCC 
TTTAAAAAGA 
CAATAAAGAG 
ATAGACATAC 
CAGAAACTTT 
AAGTACTGAT 
AACAAATAAA 
CCGACAGATA 
GAAATTTAGA 
AAAATTATGG 
AACAAACCAA 
AACATATACA 
GTATTTTACA 
ATGACATATA 
TCATGCCTGT 
GTTTG AG AC C 
ATCCAGGCAT 
TCACTTAAGC 
CTGGGTGGCA 
GCCTGTTCCT 
GTGGTGGAGC 
TCTCGAAAAA 
TGCTTCACTT 
ATAGATAGAT 
TATATATATC 
TGGTGGCCCA 
ATCCTCTCAT 
GAGACCTGAG 
TAATATATGT 
TATATATGTG 
ATTGG AC CAA 
GAGCACTGGT 
ATGAACTTGC 
GCAATTGTCT 
GTCCCTTTCT 
CTTCATCTCT 



ATTCCACCTA 
AAGATAGATA 
CATTATTAAG 
TGGGAGGCCA 
TGGCAAAACC 
GTAATCCCAG 
TTGCAGTGAG 
CTCAAAAAAA 
AAACATAAAT 
CAAAAATTGT 
CATTCTTCTT 
CGGAAGGTCT 
TGTTTAATAA 
AACTGAGTAA 
AGATCATTTT 
AGATTCTAAA 
AACAATAATA 
AAATTTGTAG 
AAGCATCTAA 
AAAAATAGAT 
GAAAAACAAA 
CTCCAATGAG 
TTAAATATTA 
AAAATTATTA 
GCATGGGCAG 
ATAAAACGTG 
AGTCAAGGCA 
CTCCCGCTTA 
ATTTTATATT 
TATATAATAT 
ATATATAAAT 
TAATACATTA 
AATCCCAGCA 
AGCCTGGTCA 
TGTGGTGCAC 
CTGGGAGGTG 
AAGGGAGACC 
GTAGTCCCAG 
CATGATCGCA 
ACAAATATAT 
TTATATATAA 
AGATACAGAT 
TATAGCATAT 
TGCCTGTAGT 
TGGTTGAGCC 
GTGGAAGGAT 
GTGTATGTGT 
TGTGTGTATA 
GAGTCCAGGT 
CATAGTTACG 
CATATACGTG 
TCATCATCAT 
TCTTCTTTCT 
CTTCTTTTTT 



TCTATTGGTG 
ATATAGTCAT 
AATGGAAGAA 
AGGCAGGCGG 
CTGGCTCTAC 
CTACTTGGGA 
CTGAGATTTC 
AAAAAAAAGA 
GTAC TACATT 
AGAATTATAG 
TCCATAATTG 
GAACAATCTA 
TAGTAAG CAG 
ATACTAGGCA 
CTCTAGCCTC 
TATTAGGAAA 
AAATGTTATT 
GCTTTAAATG 
TTTAAATTTT 
ATTAAACAAT 
CAAAAAAAAA 
ATTTATCAAA 
CAGAACTTTA 
CTTCCTAGAA 
ACCTAACATT 
CATCTTTATC 
TGAGTAATGC 
TAGACATAGA 
TGCAACTATA 
GTATAG CATA 
GAAAACCCAA 
CAT C AAAAAT 
CGTTGGGAGG 
ATATGGCCAA 
ACCAATAGTC 
G AGATTG CAG 
CTGTCTCAAA 
CAACTGGGGA 
CCACTGTACC 
ACACACAGTA 
TATAGATTAC 
ATAGACATAG 
AG AGATG CTG 
CCCAGCTACT 
ATAATCG CAC 
ATAGATATAG 
ATATATATAT 
TATATATTAT 
ATGGAGCCAA 
GGAAAAGAAG 
GAGAGTTCTG 
TATAATGCTA 
CTTCCTTCCC 
TTTTTGAGAT 
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CTGGGATTAC 
TCTGTTGCCC 
GGCTCAGGAG 
GGGGCTAATA 
GACATAAAAG 
TCAAAAAACA 
AAGTTCCCAT 
ATGGGTAAAT 
TAGTGAAATC 
ACTGTCAAAA 
CTTATATTGC 
TATGATATGC 
TACGGGACAC 
AACAACAAAA 
AATAAACTTC 
AGGCGAGAGG 
GTGTTTCTAC 
C AG CTACTGG 
GAGCCATGAT 
AGATAAGTAA 
TCTAGACTCT 
TAACCTCTCT 
TGTGATATAA 
GTGCTCAGTA 
AGTCAAGTAA 
GGTATGTTCC 
TTGTAGCAAC 
ATGGTTACAT 
ATAGAAGAGA 
CCTACCTAGA 
CCTTTATATT 
ATAAAATAAA 
AGCTGGGCAC 
TCACCTGAGG 
AAAAATACAA 
GGCTGAGGCG 
AGCCAGCCTG 
TAATAAACAG 
TAGTTTCACT 
CAGCCTGGGC 
AAGAAATAGT 
ATTGACTCAG 
TGTATAGGAT 
AGGGGAACAC 
TTCAAATTAG 
ATCATGGCCT 
TATTCAAATT 
GGTTGGCCAC 
TTCACACCCT 
ACCAATAGGA 



TCTGTCGCTC 
GGGTTCAAGC 
CACCTGGCTA 
TCTCAAACTC 
AGGCGTAAGC 
AGGCTGGGTA 
ATCCTCCTGC 
AAATTAATTA 
ACACAGACCT 
GGATGAAAAT 
AAATCAATAA 
CACAGAGGCC 
AGAAAAGCAC 
TCATAAATTA 
TTGTGGGGTA 
ATCCCTCACA 
AAGGAAGCAT 
AAACCCAACT 
AGGCCGGAGA 
ACTGCTTGAG 
AAAAAATTTT 
GGAAGCTGAC 
GGGGCCATTG 
ATAACAACTT 
AGACTCTTTC 
GTGTCATATT 
AACAAGAACC 
TATGTGAGTC 
TATGGTACCA 
CAAAAGAAAT 
ATTGTAATAA 
ATATTTATTA 
CAGTGTATAT 
GAAGTCAGAT 
GTTTGACTGA 
CATACATTTA 
AGTGACTAAC 
TCAGGGGTTT 
AAATCAGCCA 
CTGGAACCCA 
GGTAACAGCG 
TGTTTAAGAG 
TGACCCGGAA 
GACAGAGCGA 
TTCACTTGAA 
TGAAATCCC A 
GAAATACAGA 
AGGTCTATAA 
TTTAGAAGTT 
AAGGC AG AG C 
AAGTGGGATA 
TGTGGAAGAC 
CGTAAAATTC 
TATGGCAAAG 



AAGCTGGAGT 

AATTCTGCCT 

ATTTTTGTAT 

CTGCCCTCAA 

CACTGTACCC 

CAGTGGCGTG 

CCTAGTCTCC 

GGTGATAAAA 

TGAAGGAAAA 

GCAAAATGCC 

GGAAAAGAAC 

TGAAGGGCTA 

AAATTAAGTA 

TATAAGTAAA 

GAATTGGAAC 

CCAGCATGTC 

GGATAAGAAT 

ACACACAACT 

TGGTGGTTCA 

CCCAGGAGTT 

TAAAAAATTA 

GTGGGAGGAT 

CACTCCAGCC 

TGCATTTTCT 

TATGACTACC 

TCCTCCTCTA 

AAGAAAAGTA 

ATTATTCCTG 

TATATTAAGA 

GAAAGCACCA 

CTAAGTTCTA 

TATTCTTATG 

ATGTTACGTT 

TGGAGAGGGG 

TTAAAATGTA 

AAAATAAAAA 

ACTTGTAATC 

GAGACCAGCC 

GGCATAGTGG 

GGAGGCAGAG 

AGACTCCATC 

GGGAGAAATA 

GGCGGAGCTT 

GACTCTGTCT 

CCATATTATG 

GCAAAACTTA 

GTGCTTTTGG 

TTCCTTTTCT 

ATAAAGGCAA 

CCTGTAATCA 

AATAAAGACT 

AGACTCAAGG 
CTTGTCTTTG 

ATGATGGGAT 



GCAGTGGCAC 
AAGCCTCCAG 
TTTTAGTAGA 
GTGATCCTCC 
GGCCTCCTCC 
ATCATAG CTT 
CCAGTAGCTG 
TTCACTGCCC 
TGTGTCTACT 
ATCCACATGC 
CCAATAAAAA 
ATGGACATAC 
CACAATTAGG 
GACTCAGGGA 
AATTTCAAGA 
ACTCCAAGGT 
GTTCACAGTA 
TCAATGCCCA 
TGCCTGTAAT 
CAAGACCAGC 
GCCAGACGTG 
TGCTTAAGCC 
TGGGTGACAG 
GCCACATTGC 
TTCTAGTTAT 
TAAAGCAAAA 
AAGCTTTTCT 
GTGCTGGTAG 
TTAACAACAA 
GGATATAAGG 
AAAACAGCCT 
GAATATTAGA 
TGTACAAACT 
TGGGAAAAAC 
TTTGTTG CAT 
TAAAATTTAT 
CTAGCACTTT 
TGGCCAACAT 
TGCGTACCTG 
GCTGCAGTGA 
TCAAAAAAAA 
TTTAGTTAAA 
GCAGTGAGCC 
CAAAAAAAAA 
ATTCCTTCTG 
CACAAAGTCT 
GTTTTGTTGT 
GAAATCCCTG 
AAAAATG CAT 
AATTCATCAA 
TTTAAATAGT 
GTGGCCTTCT 
AGTGTGAGCA 
ATAATTTCTA 



AATCTCAGCT 
AGTAG CTAGG 
GATAGGGTTT 
TGCCTCGGCC 
TTTAATAGAC 
ACTGCAGCCT 
GAACTACAGG 
ACTGATGACT 
TAATTTTGAA 
CAGAAGATAT 
TTATTAAACC 
AAAAAGAATC 
TACCATTTTA 
GTTTTGGAGG 
TCTGTAGTAT 
ATCTCCCTGG 
GTATTGTCTG 
GTCCACAAGG 
CCCAACACTT 
CTGAACAAAA 
GCAGTGCTTG 
CAGGAATTTA 
AGTGAGACCC 
AAAATGGTGA 
GAGATCCTAC 
ATGCCCCATA 
AATCTGTCAC 
GAGTGTATGT 
CCTCGGCAAT 
ATGCATGGAC 
GAAGCTCCAT 
CATAAAAAGT 
TAGGGAAAGA 
CTTGAACTTT 
CTGCTTGAAG 
TCCTATCACT 
GGGAGGCAGA 
TGTGAAACCC 
TAATCCCACG 
GCTGAGATTG 
ATTTGAAAAA 
AGATAAGCCC 
GAGATCG CAC 
AAAAAAGAAA 
TAAAAGATGA 
TGTTCTTCCT 
TGTTTGTTGT 
GAACAAAATG 
ATACTCTAAA 
TATATCTGCA 
CTCATCTCAG 
ATGATTCCTG 
GGGCTTATGA 
TGATTACGTT 



CACTGCAACC 

ACTGCAAGTG 

CACAATGCTG 

TCCCAATGTG 

AGGGTCTAGC 

CGAACTCCTG 

CATAGCACAC 

AAGCTCTTTG 

ACCCTATTTA 

CAG CTATAAT 

ACAGTAAATC 

TCAATCTCAC 

AATCTGTAAG 

AGTGAGAGCT 

CTGGTAAAAT 

AGGGAACATT 

CAACAGCAAC 

CAATGGATTA 

TAGAAGGCCG 

TAAAGAGATA 

CCTGTGGTCC 

AGGCTGCAGG 

TGTCTAAAAG 

GAGAGTGGTT 

AACACTCACC 

TAGAGAGGAC 

AGACTAAAGA 

TACAACTTTG 

CCCAGTTTGG 

TAGAAAGTTA 

CAGTAGGGAT 

AACGAGTAAC 

TATAGATCAC 

CTCCTTATAT 

GCAATGTAAA 

TTTGTAATAA 

GACAGGCAGA 

CATCTCTACT 

CTACCCGGGA 

CGGCACTGCA 

AGAAAAATTT 

ATTTAAGAAA 

CACTGCACTC 

GAAAGAAAGA 

GAGTAGG CAA 

TCCTGTCATC 

TGTGTATTTG 

GGCTTTGCCA 

GTTCAACCCC 

GCAAAACATT 

TGCCGTTCAG 

CCTCTTGGTG 

ATTGCTTCTG 

TCATTATGTA 
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AGACTCCATC 
AAATAACTGG 
GTGGGCAACC 
AAAGAAATTC 
CCAGATAAGA 
AGGCCCAACT 
TTGTATTAAG 
CCAAATCATT 
ATTTTCAGGG 
CACTGATTTA 
CTGTAAATTG 
GAATGATAGG 
ATTGATAATA 
AGTCACAGGG 
AAGTAGGAGC 
ATGGCTAATA 
GCTCATCACA 
GAAATTAACT 
TTACTGCATA 
ATGACTCCTT 
AAGTTCCAAG 
AGAGGCGAGA 
CCTCCTGCAT 
AAAAAGAAAA 
TCTTCTTTGC 
ATGGATACTC 
ATAATTCCTA 
TATTCTCATC 
ACTAGCTTTT 
AAAGGTCCTT 
GTGGTAGCAG 
AGTCACCAAT 
GATAAAGCTG 
GAAAAGGTTA 
TATTTCAGGA 
GGTAAATGTT 
CTAATTGACA 
ATCTAAAAGA 
TATAACATAC 
TGAGCCCTGA 
CCCCAGACCA 
CCTCCAGAGA 
CCGGATTTTT 
GTG GAAGTAA 
ACACGCTACT 
GCAAATATTG 
GAAAGTAAAA 
ATGGGTAGTG 
GGAATGGTGA 
TATACTTAAA 
ACACACAATC 
CCCAGTTTCC 
CCTCAGATTC 
AAGCTGGGTT 



TTGCTGGCAG 
CCATGTTAGA 
TCCAGCCAAC 
TGCCTGCTAT 
ACACAGCCTG 
AAGCTGTGCC 
GTTCTAAATT 
TCAGGTTAGG 
GTTTTTTTGA 
AAAAATCAAG 
GTTGTAACAT 
TTATCATACT 
AACACTATGA 
CTAAGCACTT 
TGTTAATATT 
AGTGAAAGAA 
TTCACTGAGC 
AAACAACCTC 
AGGCCCCTTC 
AACATTTTCT 
GAGACAGCCT 
GATTTCTTCA 
TCAAACAGGA 
GGTGAGCATG 
TTTACCCGAA 
AAACAAAGTA 
CCAGCTTCTG 
CTTGCTGAAG 
GGTGATCGGA 
AGTGGTGAAT 
GTATCACACA 
CCTGTTAGTT 
TAGATTGTAC 
TCACTCATTT 
GAGAGTCAAC 
ATCCCGTGGT 
TAAAGGTACC 
AACATTACTT 
CTTTTTTCCC 
CATTTTTCCT 
GCCTCATAGT 
TAAGACTTCT 
TGTCTCTGAA 
ACCAAATGTC 
ACATGACAAG 
TAAGACTTTG 
TAGTGGTTAC 
AGTTTCAGTT 
TGGTTGCACA 
AGTGGTTAAA 
AGCCACTGGG 
TGCTG CAGAG 
CTTATAGTCA 
AGGAGTGCTA 



ATTTTCTCTA 

AGCCCATGTG 

AGCCAGAAAT 

CTCAGTGAGC 

ACCAACACCT 

CAGATTCCTG 

ATGGTAATTT 

CCAGATTTTT 

TTTTGTACTT 

GGAGCAGGAA 

TTGTCATCTG 

CACCTACTTT 

TCAGATCTAA 

TGTACATGTT 

TTCATTTTTC 

TAAGAATTTG 

AGTGAATCAT 

TCTGGCTGTG 

TATCCACAGT 

TCACATCGAA 

CTGATGTTTC 

GAGGTGCATT 

CTTACCTGCT 

GGAACATCGG 

GAAGTAAAGA 

ATTCCCACCA 

GGAGATTCAG 

TCTGAGGGCC 

TTTCCTTCAC 

CTGTGCACAG 

GTGGTAAAGC 

AGAACCAGAA 

CATAACAGCT 

TGGCTGCCCC 

AACCAGGGTT 

TTCATTTGGC 

CTATAAATTA 

TATCTTCCCA 

TACTCCAAGT 

CCAGTTCCAG 

CCCTCAGTCT 

CTTCCTCATG 

TCTGTACCTT 

CATCTATGGA 

CCTTGAAGAC 

CTTATACAAG 

AAGGTGTTGG 

TAGAAGATGA 

ACAATGTAAC 

TGCTTAAGTG 

ACATTATTTT 

TCATGTGTGG 

CCCAATTAAT 

GATAATTTAA 



AAGAGTCTGT 

CAAAGAGCTG 

AACCAGGGCC 

TTGGAAGTGG 

TAACTGCAGC 

AACCACAAAA 

GTTTGTACTA 

GTAGCCAAAT 

ACGGATACAA 

ACTCAGTAAA 

TGTTATCTAA 

TCTTGCATTG 

TACCATGATG 

GATGCATTTA 

AGAGGGGGAA 

AAAGGTTTGC 

ACTAACTAGA 

AGCCTGCCAA 

ATTCAGGAAT 

GTAAAGCTTG 

CAGCTTCACA 

CCATTCATTT 

CAAAGTGTCA 

TATTTCATGG 

GAGTTACCCT 

GTCTTAGGTA 

CATGGCAGGA 

AGG AG CTTTG 

AGTGAGCCCA 

CCCTGAGACT 

AATCATG CTA 

TTAATGGCTC 

CTGGAGCAAG 

ACTTCATCAC 

CTCTGCACAT 

CAAGCTGTGT 

GTGAAGGCCA 

TGCTTCCTTA 

ACACAGCCTC 

GATGTGGCTC 

ACTCAGAGTC 

TAGGAAACAC 

CTCCTGGAGT 

TGAATGGATA 

ATTCAAGCAA 

GCATCTGGAG 

CAAGACCAGA 

AAGATGAAAC 

AATGTAAAAG 

TTATATATAT 

CTCATGAGTC 

GAGGCAGGCA 

TTTCTTGTTC 

TTGTGAAACT 



CTCCTGAGCT 
AGGGGTGGCC 
AAAGTCCTGC 
ATTCTTCCTT 
CTTATCAGAC 
ATTGAGATAA 
ATAGATAACT 
GAATCATGAT 
ATTTGTGAAA 
TGGTTCTAAC 
GTCAAGTTCC 
CTCTAAGAGT 
TGCTATTATG 
ATTTTGATGA 
ACCAAGTCAC 
AC AG ATAAC C 
GAAAGTATGA 
GGGACAGGTG 
TCTTTAGTGA 
GAAACATTGC 
GCCCAACTCC 
CTATATACGC 
TTCACATTCT 
GGCTTGTCAT 
AGTCTTAGTC 
TTGATGGATA 
TGTTTATCAA 
TCCATGCTCC 
GATTAGAGAA 
GGGCCACTGC 
TACACTCAGC 
CAGATGTTTA 
GGTTCTACAA 
CCATCAGTCA 
GGGCCAAGGA 
TCCCTCAGAA 
GCCTGATGGC 
CCATTCTCCT 
ACCTGCAGCA 
TTGAGTTCAT 
TGTTGTTCTT 
TGGAGATTCT 
CAAGAAAGTA 
AACAAGAATG 
AATAAG CCAG 
TAGTTAAGTT 
AAATGGACAG 
TGAGTTGCAG 
CACTTAATTC 
TTTCACACAA 
ACTGAAGCTG 
CTCAGATGTG 
TTCAGCCAAG 
AGGGCCAAGT 



CTCTCTGAAG 
TGTAGAAGCT 
AACCATCAGG 
AGCCTAGCCT 
CCTAAGCAGC 
CATATCAGTG 
AATATAACCA 
AAAACTTTCC 
GTATAGTCAG 
ATTTTGGAAT 
TAAAATATGT 
TGGCTGAGCT 
ATCATGTGTC 
TAACTCAATG 
TTGGAGTAAC 
AGAATGCAAT 
AAGCTCTACT 
GTAAACTTGG 
AC ATAC CTTG 
ACATAGTATG 
TAGAATAAGC 
ACACCCCTCC 
ATAAAGAAAC 
GCAGGGCTAT 
TTAGATATTG 
CCCAGATGGA 
CATTTGCATC 
CTCTGTAAGG 
CACTTATCAT 
CACTAAGATG 
CTTACAGTAT 
TCTTCCTACA 
GCAAATCAGG 
CCTAGTGGAG 
GGCAAACAGT 
GTTTATTTTT 
ACTGATGTAC 
TTAATAGCAC 
ATTTCTGGGC 
TGCTCTTCAG 
CTTTCTCCAG 
TAAAGT CAGA 
TGGTCAAAAG 
AAAGTCTGAC 
AAACAAAAGG 
CATAGAGACA 
TTATTGTTTA 
TTTGGAGATG 
TACTGAACTA 
ACACACACAC 
GAAGAATGTC 
GAAGAGGTTG 
ACACAGGAGA 
TCAAACACTT 
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165241 TATCAGTTAC AAGGATAAAA AGAGGTTTTT ACTTATGATT TAAGAAGTTA GATTTCTGAG 

16S301 TTGGAGCGAT TTTCTTGAAG TAAAAG CTTA TAATGAACAT CACCCAGACT GGATTTTAAG 

165361 ACAACCAGGC TGGTAAGAGG GTC CAT AATT CTTGGCAGGG GGAGCTTTGA GTGTGACAGG 

165421 CATTTATTAT GGTTAACTGA GAAATACTGT TCTACTACCC TAGGGTCATC TTAAGCATTC 

165481 CTATGTGTAA GACTGACAGA AATCAAGTGA AACTCTCATC TGAGGAGATG TAAAGTTGCA 

165541 ATTTCCATTA GTGCTGTCTA AATTAATGCA GTGGGAGTGT GTATTCAGGG CAATTTGAAT 

165601 CTATGTTCTT GGATTGCAGT CTTCAAACTT GGCCCAAATA AACTCTCTAC TTATCTTAAA 

165661 AAAATAAAAA TTAAAAAATA AAAATAAATT CATACAGTGT TTTGATGACT ATGATATAGA 

165 721 AGAAGGGTCT TTGACTTAGG ATGAGGTGGA ATTTTTGTGT AGGAGACAGG TGCAGCTTTA 

16 5781 ACTCTTGTAT AGACGGGTTT TCATATATGT TAGTTACAAT CAAGGTCTTC CCCATTGCCC 

165841 AAGATCCTAG AAATGGGGGA AGTAAGAGTG TACTCAGGAG CTCAAGAGCA ACATCCACAA 

165901 ACAAAGATCA GGGTAGAGGT TAGAGAGGAC TCCTGAAAGA GAGAAAATTG GTAATCAGCT 

165961 TGTGGGATTT TACTGCAAGC TAGTGAATTA TATAAATATA AAGATTGGTG CAAAAGTAAT 

166021 TGTGGTTTTT GCCTTTACTT TAATGGCAAA GACCGCAATT ACTTTTGCAC AAACCTAAAT 

166081 ATTTCCATAA AAGAATGTGG CTCTGATAAT GTGGAGGTTA GTCAGCCACG GAAATAATCT 

166141 GAAAGTTTGT AGTTGCAAGT GTGTAGGTTG TTGCATTACT TGTGATGTAC TTATAAATCA 

166201 AGTATAGGCC GGGTGCAGTG GCTCACGCCT GTAATCCCAG CACTTTGGGA GGCTGAGGTG 

166261 GGTGAATCAC GAGGTCAGGA GATCAAGACC ATCCTGGCCA ACATGGTGAA ACCCCGTCTC 

166321 TACTAAAATA CAAAAAATTA GCCAGGCATG GTAGCACATG CCTGTAATCC CAGCTACTCA 

1663 81 AGAGGCTGAG GCAGGGGAAT TGCTTGAACC CGGGAGGTGG ACATTGCAGT GAGCTGAGAT 

166441 CGCACCACTA CACTCCAGCA AGACTCCATC TCAAAAAATA GTAATAATTT AAAAATAAAT 

166501 AAATAAATAA AGTATATTTC TTTCATCAGC TTCATGAGCT TGAGTAGTAT GAATTTCAAT 

166561 CTGGAGTGAT CCTGTTTTCT AAGTGTTCAC AAAGCTTGGT TTCTGTACCT GTAAAGTTGA 

166621 GAGCCAGATG CTCCACTGTG GTAAAAGTGC CAGGGTAATG AGTTGAGGCC TGCAAACCAG 

166681 GTTTATTTTG AGGTATTTAA AGTTTGAGAC CCACTCGATG CTTTTTCTAG GTAAATAGTC 

166741 ATACTAATTC TGCTTCTTCT GACTGAAGTA TCAGGAATCC CAGCCAACTA CAGTTTAAAG 

166801 ATGGAAAGAT TGGTGCTAAA TACTCATGGA TGTAAACCTG GAACCAGGGG CATAAGTACA 

166861 AATAATGGTT TCTTCCTTGG GTTTCATTTT TTCAATCTGG TTTAGTGAGA ATAAATCCTC 

166921 ATTGTGCTTT TCCTCAATCA TCCCCTATGC CTAAGCTCTA GAATGGAAAA TAGCTTGAGA 

166981 TCAATGAAGT CAGATTCTTA CTTTCCATTT AGTTATTCGC ATTG CTGTGG ACAGCTTCTG 

167041 CTCCGTACAT CTGTCTTCAA GTTGCTTCAG TTTTGTCACA GCTTTCTGGA GCTTTTCCTG 

167101 AAGGAAAAAT TTGATAAGTG AAGCCTATTC AATTTGACTC TTCATTAGGG ACCTAGGGGG 

167161 AATCCCAATC TTCTAAGATA TATTTGAATA ATAGTGAATA TTTATAGAGT CCTCATTGTT 

167221 TTTTGCTAGA GAGCATGCTA AAGGCTATAT GTGCAGGAAC ATACTGATCC CCTTGGCAAC 

167281 CCTGAATAGT TGGTAGGATT TTAAACTTCA TTTCTGTGCT GTAGAAAATG AGACTAAGAA 

167341 AGGGGTAAAA TAACTTGCCC AAAGGGCTAT GACTGCCAGG TGGTGGAGCA ACAATTGCAA 

167401 TCTCATCTGC TGACCCAGAG CCTGAGCTAT GTCCACCACT AGAGTCCTGC CAGGAAAAAG 

167461 TTGGATATAG AACAAGGTAA TCATCATCTA AAAGATTTTG TAAAACAACA TGCTGAACCA 

167521 AGCAAAACCA ATACCAGTGT TTGGCACACA TGAAATTTTG TGTCTTATGA GTCAGGAAAA 

167581 ATCAGGATGC CAGCTGGTTA TTAGAAACAG TTCATGGAAG AGGGGAATTC TGGTATCTTT 

167641 TGAACAATGG TATCATGAAT CCAATTTAAA ATGATTTAGT ATTCATGTCA AGCTTTTAGC 

167701 TTATTCTTCA AAACAGTTTC TCATATTTCT ATTGAAAGTG ATTTGAAGCT GACCCAAATT 

16 7761 GCTAATTGTA GTCAATGCTG AAAGAATTGT CTCCTGTCCT CTGTAAACCC AACAAGTATA 

167821 CTCATTCATT CTCGAGTGTT CTCAGGAAAA GGTTCTATGT AACTGTTTTA GCAAAAGATG 

167881 ACATTGTCCT T ACT AT ATG C CAAGTGCTAT TCTATGCATT CTATATTTTA ATGTCCTCAA 

167941 AGCTTATAAC CACCTCCTGT GTATGTGTTT TAGGGAGGGA GGACACTGCT ATTATCCCCA 

168001 TTTACAGATG GAGAAACCAA GGTGTGAAGA CATTAAGTAA CGTGCCCAAA ATTGCCCATC 

168061 TAGTAAGTGA CAAAACTCAA TTTCAACATA AGCTGGTTCC TTTTCTTACT ACTTGGTGGA 

168121 AAAGTAATTC AAATGGGAAT ATGATCATCG CAGTTATTAG CTGCTCCATG GAGTTTAAGG 

168181 AAGAGCTGCC ATGAGCTGAG TGGTGGTCAT GATTGACATG TCCTTAGAAG GACTTAGAGC 

168241 CTTCATACAA GACCACCTCT GCCTCATGGA GGACAGAATA AGGAGCCTGA CACTGGAGAC 

168301 AACATTTTCC TCAAATTTAG GCAGGACAGA GAAGGAAAAA GGACATCAGG ACTATGCCCA 

168361 TTCCTCCATG CTGCCAACAG CAAAGTCCCA CCTTCCTTAA TATGCTTTCT GGCAAGAAAT 

168421 CTGGATGGTA CACAAAACCT CTCCCTCTGC TTCACCTTCC ACAACCAAGC ATTTCCAAAT 
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CTTTGACTCT 

ATAGTTTGAA 

ACAGTGGTTC 

GGTTTGAGGC 

ATCCAGGTAT 

TTGCTTGAGC 

CCTGGGATAC 

TACCAGAGTG 

AGGTGTTCAA 

TAAATGTCCC 

TTTTCTTTAC 

CCCTACTGCA 

AAAAAATGCG 

AACACAAATG 

AGATAATTGA 

GTATTTCCAA 

ATAAGACTTG 

ACGTCTTCAA 

CAGATGAGCT 

GACATTTCTT 

CGGAGGCTAT 

AGTTGCTTTT 

CCACAGTTGA 

TCCATCATCT 

TGGCTCTAGA 

CCTGCACCTC 

TTCCAACCCC 

AGGCTGAGGT 

CCCTCATTAG 

TTTTAAAGTT 

ATTGTTCTTT 

GAACAAAATG 

CCTGATAAAT 

CTTTTCTCAG 

CCCAGGCTGG 

GCGATTCTCC 

CCCAGCTAAT 

GGTCTCGATC 

AGGCATGAGC 

TTGCCCAGGC 

TGGGTTTAAG 

CACCATGTCC 

ACACAAACAT 

TTAACTTAAT 

GATTGCTTAA 

GTTATCACAG 

AAAGCTGTAA 

AGAATAATTG 

GCCGAGGCGG 

AACCTCATCT 

CCCAGCTGCT 

GTGAGCCCAG 

AAAAAAAAAA 

ATGCAATTGG 



TCTTCCTGAA 
TTTTACTCCT 
ATGCCTCTAA 
CAGTATAAGC 
GGTGGGGCAT 
CCCAGAAGGT 
AG AG C AAG AC 
ATATTTTCAA 
TTTTTACGTG 
TCCCCACCAC 
TAGACTGTAG 
TGGCTTTACA 
GGTGTTTGGT 
GATAGAGATA 
GCAAGAAATC 
GTCATAATGT 
GTACTTACCA 
CAAGAGCTGT 
GCCCCTCATC 
GATCCGTCTC 
C C ATATGAAA 
GGCTTGGGTT 
TGCTTACTGG 
TCTTGGTGCT 
GATGGAGATG 
TATGTGATGA 
TATTATCTCA 
TGTTTGGGCC 
CAAGCAGTTA 
GTTTGCCAAG 
GTATTACAAA 
CTTTTAAACA 
TTTGCATACC 
TCTTCTAACT 
AGTGCAGTGA 
TGCCTCAGCC 
TTTTGTATTT 
TCTCGACCTT 
CACCGTGCCC 
TGTAGTGGAG 
CAATCCTCCT 
AGCTAAAGTC 
CTAGCTGTAT 
AAAAATTAAA 
TAGTTG CATG 
AAAGTTACCT 
AAGAGAGAAC 
TTACCAGGCC 
GCAGATCACC 
CTACTAAAAA 
CAGGAGGCTG 
ATTACACCAC 
AAAAGAATAA 
GTGATCTGTG 



TCGTGCTTAA 
TGATATTCCT 
TCCCAGCATT 
AAGAAAGGCA 
CCCTGTAGTC 
TGAGG CTGCA 
CCTACCTCAG 
TGTCACTGAC 
TCCTTCAGGA 
CAAAACCAGG 
ATACCTAAAA 
TATTGTGGTT 
TTGAGAACAA 
AGAGTCCAAC 
ATCATAAACT 
TAGGTTTCAA 
AAGCTCCCGG 
GGTGTGCCCT 
TTCGCAGAAC 
TTTGAGGGCT 
TGGAGCCCGA 
TTTAAAGAAG 
GTTCGTCATC 
GGTGGTTGAG 
AAGCAGCCAG 
GCTGGCTGCA 
TTTTGTATTG 
ACGTTTGAGA 
CAAGTGGTTG 
AATTTACATT 
TCTCGGGAAT 
TGGGGTCTTA 
TCACATAGCT 

CGCTATCTCG 
TCCCGAGTAG 
TTAGTAGAGA 
GTGATCCACC 
AGCCTCTTTT 
GGCAGTGGCA 
GCCTCACCCT 
TTCTCTCCAG 
AGCTAATACA 
ATGAAAAAAT 
TGACTAGTGG 
TGGACCAAGT 
TCAGGGAGTG 
AGCACGGTGG 
TGAGGTCAGG 
TACAAAAAGT 
AGGAAGGAGA 
TGCACTCCAG 
TTGGTACCAG 
ACAGATTCCA 



AATCTGCCCT 
TTTATCATAG 
TTGGGAGGCT 
GACCATGTCT 
CTAGCTACTT 
GTGAGCCGAG 
AAAAAAAAAA 
CCTTCATTCC 
GTTACTTCTA 
GACCTCCAGG 
GGTGATGGGT 
TTTCAAATGA 
CCTGTTCTAA 
CATCCCATTG 
ATTTTTCAGA 
GTTAAATCAT 
GCCCACACAC 
TTGTGCTGTG 
AGGTGGAACT 
TCAATGAGGC 
CACTGGGGAC 
TCTGTTATAC 
AGGCTCAGGC 
GCCATAGCTT 
AATTTTCCAC 
ACTGACTTCC 
AAGAAAAGAG 
ACTGCAACCC 
TTTAGAGGAA 
AAAATAGCAT 
ATGTAGGTAA 
ACTGAAGACC 
CAGACTGCTC 
TTTTTAATGA 
GCTCACTGCA 
TAGCTGGGTC 
TGGGGTTTCA 
CGCCTCAGCC 
TCTTTTCTTA 
TGACCACAGC 
GGCAGAGTGG 
AAAGAAGAAA 
GTAGCCACTA 
TCAGTTTTTC 
CTACATAACA 
GCTGGGAGAA 
TGAAACTCTT 
CTCACGCCTG 
AGTTTGAGAC 
TAGCTAGATG 
ATGACTTGAG 
CCTGGGTGAA 
AATTACTCTT 
TTGAAGGAGT 



CTCCTCCCTT 

ACATGCCACA 

GAGATGGGAG 

CTACAAAAAA 

GGGAGGCTGA 

ATTGCACCAT 

AAAAAAAAAA 

CCAAATGAAA 

AGATGAACCA 

CAGACATTTT 

CTTTCTTCCC 

TATTCATGGT 

AGCAAAAAGA 

AAGGTCAGGA 

AGAATGACAT 

CTCAGCTCCT 

TCACCTTGTA 

GTGCCCGCTC 

GCTCTCCGTG 

TTCCCAGCTG 

AGCAGAATGT 

ACAAGTGGCA 

AGATGGAGCA 

TTATTGAAAA 

CGTGATGAAA 

ATAGGTCTTG 

GACCTAAAAG 

AAGTG CAGAG 

AAAAAGCAGT 

AAGCTTTTGA 

TAGATGAGGC 

TATACTCCTG 

TAAATTATTT 

GACGGAGTCT 

CCTCCGCCTC 

TACAGGTGTG 

CCATGTTGGT 

TCCCAAAGTG 

TAAGACAAGT 

TCACTGCAGC 

CTGGGACTAC 

TGCATTGGAA 

TCATGAGTAG 

TGTTCCAGTT 

GCCTCAATAT 

GCAATGCAGG 

TCCTATTCTA 

TAATCCTAGC 

CAGCCTGACC 

TGGTGGTGCA 

CTCCGGAGGG 

AGAGCGAGAA 

TGTAATTAGT 

ATGGGGAGCT 



TCTTATACGG 

GTAGCTGGGC 

GG AG AC CAGG 

TAAAAAAATT 

GGTGGGAGGA 

TGTACTCCAA 

AAAGTAGAGG 

ATCCCCCAAT 

CTCTCTACCC 

TGATGGTTTG 

TGTTTTCAGG 

GTGAAACAAG 

AATTCATCAT 

TGGACAGTCT 

GATGAAAGCT 

GGGGAG CAGG 

GCCCTGGCAT 

ACAGCGCCAG 

TTCCTCACAT 

CTTGTTGGGT 

CTCCTGCCTC 

GTAGCTGTGT 

GGTGGCTTCC 

GCTCCAATAT 

ATACAC CTCA 

AAGGTTTTCC 

GAAGAAGTTG 

TTTCAAGTTG 

TTTAAAGCAG 

CTGGCTATAC 

AGCCAGTCAG 

CCTCACTTGT 

CATTATTTTT 

CACTCTGTCA 

CCGGGTTCAA 

CACCACTACG 

TGGCTAGGAT 

CCAGGATTAC 

TCTCGCTCTC 

CTCGACCTCC 

AGGTATGTGC 

TTTAGAGGAT 

GAATTTAAAT 

GCCACATTTT 

ACAACATTCT 

CTTCCTCACA 

GTTAACTTCA 

ACTTTGGGAA 

AACATGGCAA 

CACCTGTAAT 

GGAGGTTGCA 

TCTGTCTTAA 

AGTAACACTT 

TCACCCCAAT 
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AGGACCTATC 

GGAGGTAGGG 

CAGGAGCGAA 

CCCTCCCCCC 

AGTCAGGAGG 

GGGTAGGCTG 

TGAAGACGTG 

TGGGGTCACT 

GTCCTGGGCC 

AAGAAAGAGC 

GTTGGAGGGG 

TCACATATCC 

ACTTACATAT 
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AAACGTAGGA 
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CTTAAGCCTT 

AACTGCTCTG 

CACAAAATTC 

CTCCTTGCTG 

GAAAATGGGA 

CCCTGGGACC 

AGTCCTTAAT 

CACCCGTTAG 

CTTGTCACAT 

TGGCTTAAGT 

GAGGAGAAAT 

GCTGACCCTG 

CTACTCAAAA 

ACTCTTCAGC 

GATTCATCAT 

CCTGAGGCAA 

CGATGAGTCC 



CTGGTATAAT 

TTTTCCCCTA 

CCAACCCCTA 

AATCAGACAC 

CCATTTACAA 

TCCCTAGTAA 

CCCCTCTGAA 

ACATACATAT 

AAGTATTTAT 

CTGTGATTCT 

CCTTTTGTGA 

TCCTACACCA 

GAACAACAAC 

TTGTTGTTGT 

AGGAGAATAG 

ATAACGTCTT 

AACCAGAAGA 

GGAAAGAAGG 

GAGCTCCTGC 

TCAAGAGACA 

AAGGCAGAAA 

AAAGCCTGCC 

CGCCCGCCCC 

AAGTTTGAAG 

TTTTCCTCTC 

TATTCCTTGG 

GCTCTTCTGG 

CCACTCATCT 

AGGAAAGAGG 

CCCTGCTGTC 

ACTGAGAAAA 

TCGCTGCTAG 

TTAAACTGAG 

GTGACGTTGT 

AGAAAATATC 

ATCATGTGAC 

GACTCTGATT 

CCTAGCTGAT 

AAGGGTGTGG 

ATCTGAGTCA 

TCCTTCTGCA 

GTCACTAGTG 
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AATTATTATT 

TTATAAGTCT 

AGATGCAGTG 

TAAACTTGAA 

ATAGC CAATG 
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CCAGCACACA 

TTGCAGATAT 
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TCTACATAAA 

TTATCTCATT 

TTTCACAAAA 

GTTAAACTCT 

TCATTTACTG 

ATAAATATGT 

TTATACATAC 
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AGCCCATGTA 

GTCGATTTTT 

TCATGACAAT 

CAACACTGGT 

TGTTGTTTTT 

TGAATACCTC 

GCTAGAGACC 

ACCAGTTTAT 

CTGCAGAGCA 

AGAACTGGGG 

TGTTCAGAGT 

GAAGATGGGG 

TCTTCTGAGA 

CACACCCCTA 

AGTGCCTAGA 

ACAATTTGAT 

CAGGCTATTT 

GGAGATGGGG 

AAGTTCTGAA 

TGAGAGCTGT 

ACGAAATATA 

CCTTAGCCTG 

TCCCCTCTGT 

AAGTGAAACT 

TCATATCATT 

CTTCTTTTAC 

CCCAACACAG 

TGAGATCTTT 

GTTACTTCTT 

TGGAAAAAGG 

GCTTTCTATT 

GGACTCAGAT 

GCCCAGCAGT 

TTTGTGCTTT 

CAGCTTCTCC 

TCATGGGGAA 

CAGGTGTAAG 

GTCCAAGGGA 

TTCTGGGAGC 

GAACATGGAG 
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CCTGGGTGCT 

GATGGTTGCA 
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AATTAAAGGC 

GACCAGTCAC 

TTGTACTGAA 

TAATGTCTGT 

GTTCCTCCAT 
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CTTGTTAATA 

CAGTGAACTT 
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TAATAAGGTC 

GCTTTCAGGA 
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GAGGGTTTGC 

AGTTTACTTT 

GATTGCAACA 

GAGGCCAGGG 

ACCTXGCTGG 

CTCCTGGGAG 

ATAAAAAACA 

CAGTCTCTTG 

CCTCCAGTGA 

CTCCCCTCCT 

TCTTCTGAGA 

AAAACAAAGA 

TTCCCCACCC 

GACCTTTTCC 

TGCTGCCACT 

ACTGTGGGAG 

TGCACTCCGC 

AGCAATAAAA 

AGTATCTAAA 

CTACTTTTGC 

TTGCTATTTA 

GGTGGTAACA 

CTTCTCTGTC 

CTTCTTCAAT 

GAGTGCCCCC 

GTGGAGAAAA 

ACTTCAAAAT 

AAAAGATGGA 

AGGCATTTAT 

ACCAGTAAGG 

CACTGGCCTG 

TTTGGCCCAG 

ACGTTCCTTT 

CACAGAGCCT 

GACCCCATGC 

AAAGAATGAG 

CATTGTTGTG 



C CTTAGAG AT 

ACTAGACAAG 

GAAAAGAGGA 

CTCTCAGGCT 

TCATTCATCC 

AATTACCTAT 

AAACGTTATA 

ATACATATTT 

CTACCCCCAT 

AATTTGTATG 

CAGAAGGCAA 

TCCACCTCGA 

GGTTGTTTTT 

G C AG AGGTAT 

AGGGGTGCCT 

GATAATGGAA 

CCCTCTCCCT 

TCCTGAGG AG 

TACTATCTCT 

TAAAGAGTTT 

ATAGGCAACA 

GCTCTCCCTG 

CTCCTCTAGG 

GTAATTTAAC 

AAGCCACACA 

T AC AC C AGG C 

TCCAAGGCTC 

TTTGGTGTAA 

AAGTCCTGAC 

CACTTGCCAT 

GTAACCTTCA 

TCCTGGGTCA 
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CAGCAGATGC 

AAGAACAATT 

CTAAGAATGT 

CATTCATTTT 
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CATACATATT 

ATATTTATGT 

TGGCAGAGGG 

CCTTTTCTCC 

AGGGGAAGTG 

CCCCCCCCAT 

TGTTTGTGTT 

AATAGGCAAA 

AAGTGGGACT 

GCAATCAAGG 

AAACTGAGGG 

CAGTTATTTC 

TCTCCAGGAC 

GCAGACCCAA 

GAGGAGTGAC 

TACCCCCGAT 

ACAGGGGCAG 

TACAATTACC 

GAATTTCTTC 

CCCTCTCTGC 

CAGGGTTCCT 

AGTCTGGTGA 

CATTTTCAGA 

CAGTACACAC 

CTGCTCAGAC 

GGAAGTTAAC 

TAAGATTTAG 

AAGGAGGGGG 

AATTAATAAC 

CTGCAGAGGT 

CTTGGGAGTC 

GTGGTTCTAT 

TCATTGGCAT 

TCTTGTTTTT 

AGCCAGGATA 

TGTGTGGGAT 

TCCAGGGTCA 

GTACTATCAC 

ACAATAAGAG 

TAATAAATGC 

ACACAGGTGG 

TGGCCTGCCT 

TCTGGTCCAA 

CAAGCAGAAG 

TGCCACCTAC 

ACATTCTACT 

CTTGGTCCCA 

TGTAC CCAGG 
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CATGGCTCAT 
TGAATGGGGG 
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GCTGATAAAA 
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TATACCAGGA 
CCACAGCAGA 
CTTGTGCAGG 
ATGAGAACAG 
CACCAGGTCC 
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TTATTCCTTT 
CACGAGATCT 
TCAAAGGATC 
ATAATAACAG 
TCCCATTGCA 
GGGGGTGCAG 
CTGAACTATC 
TGATTATGGG 
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GAATGCCTTT 
GTTAGCAAGT 
CACTTATCAC 
TGCAAAGGGA 
TCTAAGCATA 
ATCACAAGAG 
GGGGTATGTG 
ATGCATGTGA 
ATATGTCAAA 
CTGCTTCTCT 
GGCTGTGACC 
GAGTGGAGGG 
AACTCACAAA 
AGAGGGCTTA 
AAACTCTGTT 
TTTTTGTCTC 
ATTTATTAAG 
TTTGCTCTTC 
TCCTACCAGT 
TGCGCATGTG 
CAAGTTCCCT 
CTCTGGGTAC 
AGGGAGAGTA 
TTCGGGCATG 
TGGTGGGTCT 
GATAGTTGGT 
GGG CCATATA 
GCATCACAGT 
CCTGGCTGAA 
ATCCTCCCAT 



TCCAGATCTG 
CTAGGTAAAA 
TCACAGGATG 
CAGGTTGCAG 
CTCTAGTCAT 
ATTCCCACCA 
ATAGTCTAAA 
CATATGAATA 
TTAGTCCATT 
AAAAATGTTT 
AGGCAAGGAG 
GAAATTCCTT 
CAGTATAAAT 
CTCCCACAAT 
AGTATCCTTA 
ACTTTCTTGC 
AAGAACCTTC 
AGGAAAAGGA 
CAAGACAAAA 
GGAAGGAGGG 
ATGAGACTTC 
CCCAAGGAGG 
ATATAGGAGT 
TGTACTGTGT 
CCTCAACTCC 
GTACCCCTCT 
TGAATATTTT 
TGGATTTTTT 
GTGATGTGCA 
GTGTGTGCAC 
GTACTGTGTG 
TGTGGTATGT 
TGGTACTAGA 
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TACTGGGCTG 
GCCAAGGGAA 
TTTATTAACA 
AAAGCTTATA 
GATGGGATTA 
ACAGC CAGG A 
TGAAAGGAAA 
ACAGTTGAAT 
GCATGCAGGC 
TTAAGGAAAG 
TATCTGCACA 
CATTCCCTTA 
AGTAGATCAG 
GTTACATTCT 
GGATCTTAGG 
GGCAGGGATG 
TTAGGGTATC 
GTGAAAGCAA 
GTACAATGGT 
CTAAGCATTT 



TCCTATTGTC 
TAAGGCTGAG 
AAATAGGCAG 
TAAAGAAGCT 
TCTCATTGCT 
GCTCCATAGT 
AAGGGGAGGA 
ATCCACTCCT 
TTCACACTGC 
CATGCTCTTA 
GAGCAAGTCA 
CCTATAAAAC 
TACTCAGGGA 
ATGTGGGAAT 
GTCCAGAAGC 
TTTCACTTTA 
TCTTAGGGTC 
AGCTAGTGAA 
GCAAAACGGA 
GCTGGTTCAT 
AGGAATATGT 
AATGCATTAT 
CCAAAGACTC 
GTTCCCACTG 
TTCAGATTTT 
CTCCCTCCCA 
ACTAATTTAT 
TCAATATTGT 
TTAAATCAAA 
ATATGTGCAT 
TGCTGTGTGT 
GTGTGCGTGT 
AAGTATCAGA 
GGATATACTG 
AGGAAATAAA 
ATTTCCCCTT 
TGTACACAGG 
TATTATCCTG 
CTGTTGCGGA 
AGAATTAGGC 
GCTCTCAGCA 
ACTAGGGCTT 
CTTTAGACTG 
GAATCATCCA 
AAACATCCGG 
CTGTCTGCCT 
GGAACAGAGA 
TGGTCTTACA 
TAGATAAAGA 
TCAGAGAGAC 
AATTTCTGAG 
TTTTTTGTCT 
ACGATCACAG 
CAAAGTGTTG 



AGAGGTGTTC 
ACCTACTGGG 
TTGGCACAAG 
GACCAAAACC 
CATTATACAC 
GG TTTAT AAA 
ACGCTTGGTT 
TGTTTAGTAC 
TGATCCAGAC 
CAGTCCCACG 
GGTCTTACAT 
CATCAGGTCT 
AAGACCTGCC 
TTAAGATGAG 
TGATGCTCTG 
CTGTGTAGAC 
TGGGTTGGGA 
TGCTAAAAAG 
TTGTGACAGC 
GCACAGAGTG 
TGACAAAGGC 
CTCTAATATG 
ACAATGGGAA 
TGGGCAAGAG 
GCCTCTCAAC 
AACATTTTCA 
TTTGTTTAGT 
TCACTGATGA 
GAGTAACTTT 
GATGCACGTA 
G GTATGTG AT 
C CAT AC AT AT 
ACTCATGGTG 
ATGGTTTGGA 
AACGAAAGTA 
TGGCTTCTGG 
GAGAACCATA 
GCAAAACAGA 
TTTTTGCTCC 
ATGCAGCCAT 
AAGACAAGGG 
AAGACTCAAA 
AGCTACTCCA 
CTGCAGGCAT 
TGTAAGCACT 
AAAGCAAGCT 
TTAACTTGAA 
GGAAGGGTAA 
AACTTTAATT 
TTTGAGGCTT 
CCCCAACAAG 
GTTTTTAGAG 
CTCACTGTAA 
GGATTACAGG 



AAACCAGAAT 
CTGCATTCCC 
ACACAGGTCA 
CACCAAAATC 
GAATTATAAT 
TACCATGGCG 
CTGGGAATTG 
ATAATCAAGA 
ATACCTGAGA 
TGTCTGGGGA 
GGATGGCAGC 
CATGAAACTT 
CCCATGATTC 
AGTTAGGTGG 
CCTGTAGAGT 
TTGCCCCAAA 
CCCCCTTTCT 
GAAACAAACT 
TGTCCCATCT 
GCCAATATTA 
AGGCCTAGGG 
TAAAGTTAGG 
GTAGGTCACT 
TCAGCACTCA 
TAACCCTATC 
AATCTATTTT 
GTTTGCTTCC 
ATCCCAGTAA 
CTGAATTGCA 
GTGTGGTGTG 
ACATAGTTTG 
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CTTACTGGTT 
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CTGAGTAATT 
GACCTCACAA 
AGGCAAAGAG 
ATTGACTATC 
AATTACCTCC 
GGACACAGCC 
AG C CATTCTT 
TTCTTTCTCA 
GGTAACACTA 
ACCATTACCA 
CACACCTGTT 
G AAG C AG AG A 
AGAAATCAAC 
CTTGATCCTG 
AGAGTCTCCT 
GCTATTCCTA 
CTGACCACTT 
GTTCCCATGG 
CTCATGAGAA 
CTAGAATATT 
CTAAACACAC 
GGTGTTGTGT 
TGTTAGTGTG 
GGGGATGTTA 
TCCCAGAGAG 
ATAAAAAGAA 
TGGGAAAAGA 
CTGAAAAATC 
CCACTTCCCA 
GGAAGAAGAG 
TAGGTCCGGG 
GTGGAGTAGA 
AGATTTCTGG 
CTCCACCCTG 
TTTCCTGAAC 
GCCCCCTGTG 
GTCAGAGGTT 
TTTCATTACT 
TGAAAGTCCG 
TTGCTCTTTT 
TTTTGGTAGG 
AT ATGAC CAA 
AGATGTGATA 
GCACTGTCAC 
GGTTCAAATG 
GGTACCCAGC 
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AAAATGTATA 

GAGGGCTGTG 

TTTTACAGAG 

AAGTGAGTGG 
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ACATTTG CTT 

AAATTCTGGA 

ACTGAAATAA 

ACCGGAAAGG 

AAAGAATTCG 

AGAACGGCTA 

TATTTCTTGA 

TATAAAGTAA 

CAT AG CAGTG 

GTGAGGATGA 

TCTTTACTTT 

GCACGATCTC 

CCTCCCAAGT 

TTAATAGAGA 

CATCCAGCCA 

CCTAGCCGGC 
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AGCAGGACTC 
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184681 ATAGGGGCTG GGTAAAATAA GGCTGAGGCC TGCTGGGTTA GGTTAGGCAT TCTAACCAGG 

184741 AGTTTAGTCA CAGGATGAGA TAGAAGGTTG CACAAGGTAC CCGTCACAAA GACCTTGCTG 

184 801 ATAAAATAGG TAACGGTAAA GAAGCCAGCT AAAGCCCACC AAAACCAACA TGGCCACAAA 

184861 AGTGACCTCT TGTCATCCTC ACTGCTCATA TACACTAATT ATACTGCATT AGCATGCTAC 

184 921 AAGACACTCC CACCAGTGCC ACGACAGTTT ACAAATACCA TGACAACATC TGGACGTTAC 

184 981 CTTATATGGT CTAAAACGGG GAAGAACCCT TAGTTCTGGG AATTGTCCAC CTCTTTCCTG 

185041 AAAAATTCTT GAATAATCCA TTAGTTTAGC ACATAATCCA GAAATAACTA TACGTGTGCT 

18 5101 TATTTGAGCA GTCCATACTG CTGCTCTGCC TATGGAGTAG CCATTCTTTT CTTTTATTTT 

185161 TATTTTTTAG ATAAAGACTC GCTCTGTCAC TCAGGCTGGA GTCTGGAGTG CAGTGACGTG 

185221 TTTTGGCTCA CTGCAACCTT CACCTCCCGG GTTCAAGCAA TTCTCCTGCC TCAGCCTCCC 

185281 AACTAGCTGG GACCACAGGT GGGTGCCACC ATGCCTGGCT AATTTTTGTA TTATTAGTAG 

18 5341 AGATGGGGTT TCGCCATGTT GGCCAGGCTG GTCTCGAACT CCTGGCCTCA AGCGATCCAC 

185401 TTGCCTTGGC CTCCCAAAGT G CTAAG ATT A CAGGCATTAC CCACTATGCA TGACCCATTC 

185461 TTTTATTTCT TAACTTTTTT TTGTTTTTTT GAGACAGAGT CTCACTCTGT CACCCAGGCT 

18 5521 AGAGGCTGGA GTGCAGTGGT GCGATCTTGG TTCACTGCAA CCTCTGCCTC CTGGGTTCAA 

18 5581 GCGATTCTTC TGCCTCAGTC TC CTGAGG AG CTGGGACTAC AG AC ATGTG C CACTACACCC 

185641 AGCTAATTTT GTATTTTTAG TAGAGACAGT GTCTTGCCAT GTTTGTCAGG CTTGTCTCGA 

18 57 01 ACTCCTAACC TCAAGTGGTC TGCCTGCCTC AGCCTCCCAA AGTGCTGTGA TTACAGGCAT 

185761 AAATCACTGC GCTCGGCCCT TCTTTACTTT CTTAATAAAC TTGTTTTCAC TTTACTGTAT 

18 5821 GGACTAGCCC CAAATTCCTT CTTGTGTGAG TTCCAATAAC CCTTTTGTGT GTGAAAGAAT 

18 5881 TTATGGCTGC TGTTCAGGCT GGAGCAAGCT GGAGCTCATG CTGCTGCTCA GACTGGAGCA 

18 5 941 TGCGTGATCT GTGATCCCAG TAAGAGGATC ATGGTCACTC CAGCCTGAAC GACAGCATGA 

18 60 01 TATCTCATCT GTAAGAAAAA AAAAATTACT AGAGGGCTTT AACAGCAAAT TTGAGCAGCA 

18 6 061 AAAAGAAGTA ATCAGTGAAC TCAAAGATAG GTCAATTGAA ATGATCTACT CTGAAAAACA 

186121 GAAAGAAGAC AGAATGAAGA AAAAGAAATA GAGCCTTAGA GACAGGGGAT ACCATCAAGC 

186181 ATACTAATAT ATGCATAATG GGACTCCTAG AAGGAGAAAA GTGAGAGGAC AGGGAGAGAG 

186241 AATGTTTGGA GAAATAATTT CTCAAAGCTT CCCATGTTTG GCAAAAAAAC ATTAACTTGC 

18 63 01 ATACATATTT TAGGAGCTCA ATGAATTCCA AGTAGGATAC ACTCAAAGAG ATCCATACCT 

18 6361 AG AC AC AT C A TAATCAGATT ATCAAAAGAT GAAGAAGATG AATCTTGAGA GCAGAAAGAA 

186421 AGGAACAATT CATCACATAC AAATAGTACT CAAAAGATGT CTGGAGTAGG TATACTAATA 

18 64 81 TCAGACAAAA TAAACTTTAA GATAAGCATT GTTATAATAA ATAAAGAAAG GTATTTTGTA 

186 541 ATGATAAAAG TGTCAATTCA TCAAGAAAAC ATAACATTAT AAACATACAT GCACCTAACA 

1866 01 ACAGAGCCCT AATATTCATG AAACAAAACT GACAGAATTG AAGGGAGAAA TAGAAAATTC 

186661 GACAATAATA GTTGGAGACA TCAATACCTC AC TAGTTAG A CAAGATCAAC AAAAAAATAG 

186 721 AAGACTTAAC ACTTGAAAAC ACCTAACCTG ACCCTAACAT AAATCTATAG GTCACTACAC 

186781 CCCAAAACAG CAGAATAAAC ATCCTTCTGA AGCTCACATG AAACATTTTT CAGGATAGAC 

18 6 841 TGTATATTAC TTCATGAAAT AAGTCTCAAT AAATGTAAAA GGACTATAAT AAT AG AG TAT 

18 6 901 ATATTCTCTG ACCAAAGTGG AATGAAGATA GAAATCAATA ACTAGGCTGG GCGTGATGGC 

186961 TCACGCCTGT AATCCCAGCA CTTTGGGAGG CCAAGGCGGA CAGATCACGA GGTCAGGAGT 

187021 TTGAGACCAG CCTGACCAAC ATGGTGAAAC CCTGTCTCTA CTAACAAAAT ACAAAAATTA 

18 7081 GCCAGGCCTG GTGGCATCTG CCTGTAGTCC CAGCTACTCG GGACACTGAG GCAGGAGAAT 

18 7141 CACTTGAACC CAGGAGGCAG AGATTGCAGT GAGCTGAGAT CGCGCCACTG CATTCCAGCC 

187201 TGGGAGACAG AGCGAGACTC CATCTCAAAA TTAAAAAAAA AAAAGAAACT AGAAAAATAA 

187261 GAACAAATCA AACCCAAAGC AAGCAAGAGG AAAATGAAAA ATTTCAAAGC AGCCAAGAAC 

18 7321 AAAAGGCACA TTATGTACAG AAGAACAAGT GTATAGATCA CATATTTCTC ATAGACACAA 

18 7381 TATAAGCAAA AAGACAGTGG AGCAAAATTT TTTAGATTAA TGAAAGACCT ACAATTCTGT 

18 7441 AC CAAGCAAA AAAACTCCCC CCAAATGAGG GTGAAATAAG ACAATTTAAT ACAGAGAAAA 

18 7 501 GAGGAAGGAA TTTATCTAGT CATATGTGAG AGTTTTATGA TACATTTTGT ACTGTATATG 

187561 TGGATGTTTT CTATTTCATT TAAAAAATCA ACCGTGCAAT TAAATGGTAG ATTGTCTTGC 

18 7621 TTCTTTTTGA TTGACACAGT CATTAACTAA AATATTGTAG TATTTTTTTA TCTCCCTGCC 

187681 TAAAGGCAAT AAACATCTAA TC AG CAG ACT AGAACAATAA AAAATATTTT TTAAAAGTCC 

187741 TTTAGGCAGA ATGATAAAAG TCCCTTAGGC ATATTGAAAT TCCTATTTAT ACAAAGGAAT 

18 78 01 AAACAGTACT AGAAATTGTA ACTATGTGAG TAAACAGATA ATATTTTTTC TCCATAAAAT 

187861 GTGGTTGACT ATTTTCACAA AAATAGTTAA CAATGTAATG TGTGATTTAT AGCATTTAAA 
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TGTTTTCTTG 

GTCTCACTCT 

ACCTCCTGGG 

GTGCCAGCAT 

CAGGCTGGTC 

TTTATTTTTT 

TTGGCTCACT 

GTAG CTGGGA 

ATGGGGTTTT 

CTCTGTCTCT 

TTTTAGTCTT 

AGTTATCTGT 

ATTTTTAAGT 

CTAACATTAA 

TGAACATAAA 

GGGGTGGAGA 

CTTCTGCATT 

ATTTTCCTGG 

TTTACATAAC 

TTCTCTGTCT 

CCATATCGTT 

ACATTTGATC 

CCTAAACACT 

ATTAGGCTGG 

TATTGAGGGA 

AAACAAAACA 

GAGAGAAGAA 

GGTG AAC CAT 

CAGAAAGATT 

TGAAGTTCCA 

TAAATCTGGG 



TAAAATGCTA 

TTCAAGAGAC 

ATCACGTAGG 

CCCTGACTCT 

CTTTGGACTG 

GACAACAGAG 

TACTGGCTAT 

GTTTATTGCG 

CAGACAAATG 

AGAATGCAAT 

AGATAAGCTA 

AAAAATTTTT 

TAGCAAGGAG 

TAGAAATGAG 

TGTATATTTT 

TGTTTAAGGT 

ATAAAAATAT 

AAAGAAAAAA 

GCATGCAGAT 

TGTTCAGATC 

AAGGGTGTTA 

TTTTTACTTT 

GTTATGTATT 

ATGAAATGAC 

GTTGCCCAGG 

TTCAAGCGAG 

GCCAAACTAA 

TCGAACCTCT 

TGAGACAGAG 

GCAACCTCCG 

TTACAGGCAC 

TCTATGTTGG 

GGTAACACTC 

TTTATGCTTT 

GTTTTTATAT 

CGATTCTAAC 

CATTTATTTT 

GTGTGATAAC 

AAGCATTCAA 

AAAAAATATC 

AAAATGCCAT 

CTACATAAGA 

GCTAACAGGT 

TTGCCTTTAA 

TGGTTCTTGT 

CCATTATTTT 

ACATTAGGCT 

AAAAAAATTA 

AAACATTACA 

CAAAGCAGCA 

GTTTTGGAGA 

G CATGCATAA 

GGACCATGAG 

AATGAAACAG 



ATAATACATG 

CACCCAAAAC 

ACTCCCATAA 

GCTATCAGCG 

GCTTTCAAAT 

GTTTCTCAGA 

TTATCCAAAG 

TCACTCTTCA 

GATAGAAAAT 

CTTGTCATTT 

GGATTGGAAA 

AG CTCATGGA 

GGGAGGATAG 

TTCCGGTGTT 

CAAAAAGCTA 

GATGGATATA 

CACTCTTTAT 

AGAATATGAT 

ATTGTGTATA 

TTCTGTGTCT 

AACTCTCCAA 

ATGTATTTTG 

TTGAAACTCT 

CCTTTTCTAT 

CTGGAGTACA 

TCTCCTGACT 

TTTTGTATTT 

GAC CTCAGGT 

TCTCACTCTG 

CCTCCTGGGT 

ATGCCACCAT 

CCAGGCTGGC 

TCTGTCTTAA 

CTGTTTGCAT 

TTAAGATGTT 

AATCTTTGCC 

TCTTTCCACA 

TGACATCCTT 

C AATTTG CCA 

ATTACATTTT 

AACCACGTCT 

GACACATTAT 

TTACCAAGAG 

GACAGCGTAA 

GGATGATTTT 

AAAATGTATT 

TCTCTATGGC 

TTTTCTATAT 

TTCTAAATGC 

AGCAACTCCT 

AGGAAAAGGT 

AGATCAAGGG 

CTTGGAGAGC 

TGAAGCCTCT 



TGATGTATGT 

ATATTTAACA 

CGGGAGTTTC 

TGTACTTTCA 

TCTTTTGTGC 

AACCTAAAAA 

GGAAGGAAAT 

CAAGAGCTGA 

GTGGCATATA 

GTGGCAACGT 

GATAAATACT 

TTTAGAGAAC 

GGAGAGGTTG 

CTGCACCATT 

GAAAAGAATT 

CTAATTACTC 

CCCGTATATA 

CTATCATGAT 

ATGTTCTATA 

TTACTGATAT 

ATGTGATTGT 

AAACTCTGTT 

GTTGTTAGAA 

TGTCGTTGTT 

GTGGCACAAT 

CAGCCTCCAA 

TTATTAGAGA 

GATCCGCCCA 

TCACCCAGGG 

TCAAGCAATT 

GACTGGCTAA 

AACTGACTCC 

ACTCTATTTT 

AGTGTATATA 

TCTCTTCTAG 

TTTCAATTGA 

GTACACTGGC 

ATTTCATTCC 

TAATTATAAT 

GCATGAATTA 

CTCAATTTTG 

CAAGTATATT 

ATGGCACTCT 

CTACTTCTTT 

AAATGACTCA 

CCTTTATGCC 

AGACATTAGG 

AAGTTTCCAG 

TGTAACAAGA 

GGAAGGACCA 

CACCAAGAGA 

TAATAAAAAA 

ATGAAGTACA 

GGCAGAACTC 



ACTAGCGTGT 

ACAATGCCCA 

TTCAGTGTCA 

CCTTG CAAT A 

AGGGAATTCA 

TAGATCTACC 

CAGTATACAA 

TATATAGAGT 

TACACAATGA 

AGATGAAACT 

ACATGTTATC 

AGAACTGTGG 

GTTAATGGTG 

GTAGGGTGCA 

TTGAATACTC 

TGATTTGATT 

TGTACAGTTA 

GTATATATCA 

AATCAATTAG 

TTTGTCTAGT 

AGAATTGTCT 

ATGACATTTT 

TCATACATTT 

TTTGTTTTTT 

CTTGGTTCAC 

GTAGCTGGGA 

CAGAGTTTCA 

CCTCGGCATT 

TAGAATGCGG 

CCCATGCCTC 

TTTTTGTATT 

TTTAACAATA 

AGCTGTTATT 

TTTTAATATG 

CCAACGTGTT 

AATATTTACA 

TAGCATCTCC 

TACTCTGAGT 

TCTTTTTGTT 

TTAGGAGAAA 

TTTCCATCTT 

TTACATGGCT 

TGTATTTCTG 

CACCAGTATT 

AGCTAATAAT 

CACAATAAAC 

CTGGACCCTA 

AAAG CCAAG A 

TAAGAAAAAG 

CTGCTGCAGA 

AGGAGGGGGT 

AATTCCGTAT 

GGAGGAGGGT 

ACATCTCTTT 



ATGG CAAT AT 

TTCCCACCCC 

ATTGGTGCTG 

AACTCCTTTG 

AGAATCTGAA 

AGATGAGGCT 

AG AG ACAC CT 

CAACCCTAAA 

AATACTATTT 

GGAGAACATT 

ACTCATATGT 

GTACCGGAAG 

ACAAAATTAC 

TATGGTTAAC 

ACAACAAAAT 

ATTACACATT 

TTATATGTCA 

TGTGTACTTG 

CTCAAGATAA 

TATTGCATCA 

ATTTTGTCTT 

GCTATGTATT 

ATGATTATTA 

CTGAAATGGA 

TGCAACCTCC 

TTACAGGCAT 

CCACGTTGGC 

TTTATTTTAT 

TGGTGTGATC 

AGCCTCCCGA 

TTTAGTAGAG 

CAAAATATCA 

ATTATAGCCA 

TTTATTCTCA 

TGGTTCTTGC 

CCATTAACAT 

CATATAATAT 

GGAAAGGGCA 

ACACTGTTTT 

ATATTTTCCA 

TCTTCCACAT 

TCTCAGTGTC 

GTGGCTATGT 

AAAGACATGT 

CCTAATTTTA 

ATTTATTGAC 

GCCATATATC 

TGTGTTTTAA 

TGTTGAGGCT 

GGTAATAACT 

CCAGGGTGTT 

TATGTAAATG 

TGGTTTCAAA 

CCTCCCCTCT 
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TCCTTGCACA 
GAGCCTAGGA 
CTAGGTTCTT 
AGCCTAAGCT 
ATGGCCTAAA 
AACCTTTCTT 
GGGCCATAGA 
GACGACCACT 
TTTGAGCACT 
ATCTACAAAA 
AAGGCCAGGA 
GTGATCCTGG 
TTTGAATTTG 
TTCAAATGTT 
GATGGATAGG 
AGTGTTCTTT 
AGGTTTTTGG 
CCAGGCTGTA 
GTAATTCCCC 
CAGCTAATTT 
AGTCTCAAAC 
ATAGGCATGA 
TTAATTTTGA 
GGCTCACTGC 
AGCTGAGATT 
GGGGTTTCAG 
CTCGGCCTCC 
CAGTCCCACT 
CTATGAGATA 
TGTGTGCTCT 
CACACACACA 
ATCCCAAGGG 
ATTATTCTTT 
TAGAGTTACA 
CTTTTATCCA 
TAAAACACAA 
AAAGTTATAC 
TGGTAATACT 
TCCTATCTGC 
GTCTCCAGAA 
CTCTCATTCA 
TATCTATATG 
CCTTCTAAAA 
GGATTAGGGA 
TCATGAGGTG 
GAGTGAGAGT 
TGTCCATCCA 
TCATTTGTCT 
CTGAGCCTCA 
AGGTTCCTTC 
GTAAACCTTG 
AATCTTGTAT 
TGGCTGAGTT 
TACAACCTCA 



TTCCCTTTAT 
AGTGCTAGGG 
CTAGGAGAGC 
CACCTCCCAA 
ACCCTTCCAT 
CCTCTAAGTC 
CGTGCTACCA 
GGCCTTTGAA 
GCTGCCAAGA 
CACCTAACCT 
AACTGTTCCA 
ATTGAGGGGA 
AATTTAAAGA 
TCAGTAAGTA 
TAGAGAAAAA 
GTACTGTTTT 
GGTTTTTTTG 
GCTCAGTGGC 
CTACCTCAGG 
TTATTTTTTA 
TCCTGCCCCC 
GCCACTGCAC 
GTCAGAGTTT 
AAACTCTGCC 
ACAGGTGCCT 
CATGTTGGTC 
GAAAGTGTTG 
CTACCTTGTC 
GAGGAATCCA 
CATGTGCTCT 
CACACACATG 
TTTTGTGTTG 
TTCTCTTTTT 
GTGCCTCTAT 
GTTCAAAATA 
AATTTTATTT 
ACACACAAAC 
TCATCAGGCA 
TTCCTTCAGC 
CTTCCATTCA 
TCTCTATTCC 
AAGTCTGCGA 
TTCTTATTAC 
TCATGATCTC 
GAGGTTGCAG 
CTGTCTCAAC 
AAATTGAGAA 
GGTGTGGTGG 
CTCCTGAGAA 
CTGGATAAAA 
GATACTGGGA 
TTCATTAAGA 
CTTTTAGAAC 
GCTAAAGGAT 



GGAGTAATTG 
TAAAGTGGAG 
CCTTCCCCAT 
AGACCCCTTA 
AACTCTATAG 
TGCCACCCTA 
AGTCTCCAGA 
CCAGACCCTT 
CATCTTTGGC 
TTAAAAATTC 
GGTTAATAGA 
AAAAGTGTTG 
TAAAGTATTG 
TATATATATA 
G C AAATGTAT 
TCTGATTTTT 
TTTGTTTTTT 
CCAATCATTG 
CTCATGAGTA 
AATTTTTGTA 
AAGTGATCCT 
CCAGCCCCAA 
CACCCTTGTC 
TCCTGTGTTT 
GCCACCATGC 
AAGCTTGTCT 
ATGAGCCACC 
CTACACTACC 
AGGAAGAAGA 
CTCTCTCTCT 
AATACCAGAG 
TAGTGGTTTG 
GCAGCTGAAG 
TCAGGCTTCA 
ATGCATTCTC 
ATGCTGAACA 
ACATTTGCTC 
TGAGTAGTAC 
ATTCTCCAGT 
CATTTAGAAG 
TTCTTCTAGC 
ATGGTTCTCA 
CCAGGGCATA 
TGGAGTCTGG 
TGAG CCGAGA 
AACACAAAAC 
CCATTAGGTA 
CAGCTTTTTG 
CACTGGTGTG 
ACCACTGACC 
AGCCTACAGT 
CTAATATTTG 
TTTTGCATTG 
TAAAAGACAC 



CAGGGATGGG 
AATGAACCTG 
AAAATCTGCC 
CTTGCTGACT 
CCAAATTCAA 
GGCAATTCTC 
CCTAGACCTG 
CTCTGTGGCT 
ACTTTGTTGT 
ATTGTCATTT 
GACTAAAGAG 
T C AG AG ACAT 
AGTAATATAG 
TAAAGAGATA 
AATATTAACA 
CTATATGTTT 
GTTTTTAGAG 
CTCACTGCAG 
GCTGGTACTT 
GAGATGGCAT 
CCCACTTTGG 
ATAAAAAAGT 
ACCCAGGCTG 
AAGCGATTCT 
CCAGCTAATT 
CAAACTCCTG 
ACACCCGGTC 
AGGGGCTAGG 
TAAGCTACTT 
CTCTCTCTCA 
CTATCACTTT 
CTCATTTGTT 
GGAGAATTTC 
TAGAGAGACC 
ACCAAGATGT 
TTGAATCACT 
CTGCTTTGTT 
GTCTTGGAAG 
GTATCTGTCA 
AGGGCAGCGG 
TATGGTCCAG 
GACTGGTTGA 
TCTCAGAATG 
TTTAGGCACT 
TGGCGCCACT 
AAAAAAAACC 
AGGCCAAGCT 
ATAAGGGAAG 
TATGTTGCTA 
CTGGGAATGT 
TGAAAATATT 
GTACAGTGCA 
AAATAGGTTC 
GTGAGCTGGG 



AAAAGTTCAA 
CGTGATTTGC 
CTCCTCGAAG 
GAATCTGATT 
TTTTAGACAG 
AACATTCTCT 
ATGGAGCAGT 
CCTATGCATC 
GAAGTTTTAA 
CATATCATGA 
ATAGCAACCA 
GATTGGGACA 
GAAGATGATT 
TAAAGACATA 
ATCTAGGTAA 
GAAATCATTT 
ACAGCATCTT 
CCTCAACTTC 
CAGGTGTGCA 
GTTGCTATGT 
CCTCCCAAAG 
ATTTTATTTT 
GAGTGCAATG 
CTTGCCTCAG 
TTTATATTTT 
ACCTCAGGTG 
TAAAAAGTAT 
ATCACCCCAT 
GGTTCCTCTA 
CACACACACA 
CCCAGTCTAG 
TGTTTTGTTT 
CAGGCCAGCC 
TGGGATTCAG 
ACTTTGAAAT 
TTTTTCTGTA 
TATTGGCCCA 
GTGTGGTCTA 
TCTGTCTACC 
CTTTCTATGG 
CTCAGCTGTT 
ACATTAGAAT 
AGTACCACAG 
AGTGCTGTTT 
GCACTCCAAC 
AACTACCCTT 
GTATAATTAA 
TATTGTTGCC 
AAATTCCCCA 
ACCCACTGCC 
GGGCTTGAGA 
G CAAATCAAG 
AAGCAGCAAT 
TAGGATGAGG 



AACCACCACT 
TCATCCTAAA 
GGGCCCAGAC 
CCACCCAGAC 
GCCTCATACC 
ACACACTTTG 
GCTGTAATGA 
TCCAACCTGT 
AACTGAACTA 
AAGATAAAGA 
AATGCAATTT 
GCTGGTAAAA 
ATCTGCAACT 
TAAATAAATA 
AAAGTATATG 
TAAAATAAGA 
ATTCTGTCAC 
CTGGGCTCCA 
CCACTGCACT 
CACCCAGGCT 
TGCTAGAATT 
AATTAACTAA 
GCATGATGTT 
ACTCCTGAGT 
TAGTAGAGAC 
ATCCACCCAC 
TTTAAAACCA 
GTCTTCTAGG 
TAGGGTCTTG 
CACACACACA 
TACTCATCTC 
GTTTGCTTGG 
CTTTGGCCAT 
TAGTGGGGGG 
AAAACAATAC 
TTTTGTGTAG 
GGGGTATGTT 
AAGCCTAGAC 
TTAGGATGGG 
AAAATATGAA 
TGGAATAAAG 
CACCTGAGTA 
GGTAGGGATA 
AAAACTACGT 
CTGGGCGACA 
GTGATTTGAA 
AG AG CAGTTT 
ATC C AC AT AC 
GGTGATTCTG 
AATCTCCTGC 
TCCTGAAACA 
GGAATTTTGG 
AAGTTAAAAC 
TCTAAGATTG 
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GGTGTGGCGG 

TGAGGTCAGG 

TACAAAAAAA 

GAGGGAGGAC 

CTGCACTCCA 

CAATAATAAT 

CTTGAGATTC 

TATGATGAAT 

GACTTCCAAC 

TGTTAGATGG 

CACTTCTAAC 

CACTCCTGCT 

TCTTTAACTA 

TCAAGTGTAT 

TCTGTTTTTT 

AGTTATATAT 

ATTTATTATT 

CTTCAAGCCT 

TTGATTGTGG 

GCACCGGTAG 

TGTCACCCAG 

AATCAATCTG 

ACACCTGCCT 

CTGGTCTGGA 

ATAGGTATAA 

TTTATGTCTG 

TCCATCTATT 

GTGTCTCTCT 

CTTTGGTTGA 

CAGATGTGAG 

GTCTAGGTAG 

GGGTCTCAGC 

GGAGAACTCC 

ATGTCTTGGG 

TTTTGTGACC 

TGATGTGAAG 

GTCTCCTGCT 

GTAGCACTTA 

ACTAAACAGT 

C CC ATCATTT 

CACAAGAAAA 

TGTACTATTT 

AAAAATGAAA 

ATTTTTAAAA 

ATTTTTCATG 

GTTTTGAGGA 

ACCGTCATGA 

TTTTGGTTTG 

ATACAGTAAA 

GAAAGTCAGA 

CTCTATCTCT 

GTATGTATAT 

CACTACAGTA 

CTTGTATATA 



CTCATACCTG 
AGTTCAAAAC 
TTAGCTGGGC 
AATCACTTGA 
GCCTGGGTGA 
AATTCAGACA 
AAGTCACACA 
GGAAATTTTT 
AGCAATAACA 
ATAAAGAGAT 
CATCAGATAT 
TAAGACATTT 
CAGGGTTGGT 
GCCATGTACA 
CCTTCATTGA 
AATTGTTACA 
ACCTAAGGTC 
CATGTGGCTG 
AAATTTCCAC 
AAGCTTCTTG 
GGTGGAGTGC 
TTCTCCCACC 
AAAAAACAAA 
ACTCCTGCGC 
GCCACCATAC 
TCTTCCATGG 
GATTAGATAA 
TTATCTTAAA 
TCCTTCTTAA 
TCTATGGGAA 
AAATCAGTCA 
CAAGGTCTTG 
CTTGGAATAT 
AATCTTGGTC 
AGATAGTAAA 
CTTCTGTGGT 
TGGGAACAAA 
CTTTTCAATT 
CTAAATATAA 
AATTTTTTCT 
CCTGGCATAT 
CATGTATTCT 
ATTTTGCATT 
TGTTTCCCGA 
TGACCTCAGT 
AATATAGGAA 
CCTGGTCCTG 
GATGCTTTGT 
TAAATGCTAT 
TTCATCTAAA 
CTCTCTCAAT 
ATCTGTTTCT 
CTAGCATTTT 
TACACACACA 



TAATCCCAGC 
CAGCCTGGCC 
GAGGTG CC AG 
ACTCAGGAGG 
CAGAGCAAGA 
TATCCAGGCA 
TGAAATTTAG 
CAAAGAGGAA 
CAGGATTAAT 
AAAAGTACTC 
AACTAGCAGA 
TAATTAC TCT 
CTGGGTGTGC 
GGTATTCTTT 
AGTCAATGGC 
AAACAAATTA 
TGTGGATAGA 
CAATCCAGGT 
TTCCAAGCTC 
GTAGAGGCTG 
AGTGGAGCAA 
TCAGCATCCT 
CAAACGAAAA 
TCAAGCAATT 
CTGGCATATG 
TATTCTAGGT 
AACGTTGTTC 
ATTCTAACCA 
CCTCTTCTTG 
AG C AAGCAAG 
TGGCCCTTCC 
TGGCCTAAGC 
CTTTTTTTGT 
TAGAGCCATT 
TAAGTTCTAT 
TCAGCCCTTA 
AGTCTGGCTT 
AGGAGTGTCC 
AAATCATGTC 
ACTGGGTTAT 
ACATGGATTC 
TTTTCACATC 
TGACTAAATT 
AGTTTTGAGT 
GCACTGCTGT 
CGACAAGATA 
TTGACTTCCC 
GTTTTCCTAA 
TTGTGTGAAA 
AATCCTTGTG 
GTATATTTGG 
ATTCCTGTAT 
TCTAATGTAA 
TACACATACA 



ACTTTGGGAG 
AACATGGTGA 
GCACCTGTAA 
CAGAGGTTGT 
CTCCATTTAA 
TCAAACAGAT 
GTGGAAAATG 
TTTCAGGCTC 
GAGGACTTGG 
TCTCTAAGAA 
CTAAACGGTC 
CAGTAACTCT 
AACACAAGAA 
CATGTACTAT 
TGATATTAGA 
GCAAAAACTT 
AGTTCTGACA 
GTTGGCTGAG 
CCTCAGGTTT 
ATTCAACTTC 
TCATAGCTCA 
GAGTAGCTGG 
AAAACCCCCA 
CTCCTGCCTT 
GCAAGTCTTG 
TATTGTTGAG 
CTTCTGTTAT 
AAGAGCTGCT 
CCCTCTGGGG 
AGGTTCTTCA 
AATGTGGTAC 
CTTATAGAAA 
GAACCTGAGG 
TCAACCTGAT 
GATGTTCACT 
CTTCATCTTC 
CATTCTATGA 
TCACTTCTAT 
CTACTCCTGC 
CTTTAACTTC 
AAGTGTATGC 
TGTTTTTTCC 
TGTCAAATTT 
GAAGTTAGTA 
GCATTTCCAT 
AAGTTCAAGC 
TAGATTTCCC 
AATCAAAATA 
CTTTAAACAA 
GCCAGAATTA 
TGTAGGTATA 
GTGGATGTGC 
TTCAATATTG 
TGCATGTATG 



ACTGAGGTGG 

AAACCCATCT 

TCCCAGCTAC 

AGTGAG CTGA 

AAAAATAATA 

ACCTGGGGCA 

ACATTGGAGA 

TGTTCTTGAG 

GATGTTACAT 

CATGGGACCA 

TAAAAATAAA 

TCAGTTTTTC 

AGCCTGGCAT 

TTCATGTATT 

TTCTACTATT 

AGTGG CTTAA 

TGGCTTAACT 

TCTGAATTCT 

GTTGAAAAAT 

TAG AGG CTGT 

CTGCAGCCTT 

GACCACAAGT 

GAGAACTTTG 

AGCCTAAAAG 

AGCAGGACAA 

ATGGTCCTCT 

TTTTCAACAG 

CTTTTCTTGG 

CCTAAGATGA 

GCCTCCGTTC 

AGACCAGATC 

TAATGAGTGT 

CAACTTTTGG 

TTCTTTTCAT 

CAGAGAAATA 

ATTCCCTCTT 

CCCCCACGTT 

CCATCAGACA 

TGAAAACATT 

AGAGTTGGTC 

CACGTGCATG 

TCTAAAATTT 

AGTCAAATTT 

CTTCAGAAAA 

TTCTGCGTCC 

TCCTGGACAT 

GCTATTTCCT 

GGTTTTTGCC 

TACAAAAAAA 

ACTACCTTAG 

GGGGTGTGTG 

AC AACG CATC 

TTGAAAACAT 

TACATATACA 



GTGGATCACT 

CTACTAAGAA 

TGGGGAGGCT 

GATCGCACCA 

ATAATAATAA 

GATGAATAGT 

AATTTGAGAT 

GGGATAGATG 

AAATTAGAGA 

GAGATAGGCT 

AATCATGCCC 

TACTGTGTTA 

ATACATGGAT 

CTTTTTCACA 

CATGTGTACT 

AGCAACACAC 

GGGTTCCCTG 

CATCAGAGGC 

TCAGTTCTTT 

CTGCAGTTCC 

GACCTCCCAG 

GTGTGCCATC 

TAGAGACAAG 

TTCTGGGATT 

ATACAGATGA 

ATTGTCTTGT 

TAGCTTTTAT 

TGTACTTTAC 

GGGCTGTTAT 

AGCCTTAAAT 

ACAGAGACAG 

TTACTTACTT 

TGATTTCTTG 

GTCAGTGGCA 

CAATGACTTA 

ATCTGCATCT 

GAGTTTCTTA 

TAACTAGCCG 

TTAATTACTC 

TTGTGTGCAA 

TATTCCTTCA 

ATTTCCTTTT 

GTTTAAAACC 

ACTGTTTTGT 

ACACACATTT 

TGCATAAAAG 

AAGTTGAGAT 

TTTTATGATT 

ACCTAAGGAA 

TTATTATTTT 

TAGTGTGTGT 

CTGCTTTGTA 

TTTAAAAAAG 

CATACAGACA 
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AAAATGTATC 

ATCCATAGTT 

TTTTTTTTTT 

CTCACCTCAC 

AGTAGCTGGT 

GAGACGAGGT 

CCTGCCTCAG 

CTTACACATC 

GCTATTATGG 

TTAATATTCA 

AAAAATGTAT 

TTTTAACTGT 

ATGAAAATTT 

ACTCAATGTG 

TCAAGAATAT 

GTTTTCTGCC 

CATATAATTC 

TTTCTATTCC 

ATCTTTCCAG 

TTGAACACTT 

TCCCCTGTAT 

TGGGAAGCCT 

ACCTGTGCAA 

GTGTGGTCTC 

ATAAATTAAT 

AAATATAGTA 

CCGAGGCAGG 

ACCTCATCTC 

C AG CTACTCA 

GAGCTGAGAT 

ATAGTAATAA 

TTTTTAAAAA 

TACATAGGTA 

TAAGTACAGC 

CCCCAACAGG 

GCTCCCACTC 

ATGTCAGGCC 

TTGATGTTTA 

ATTTAACTGA 

GCTACAGAGA 

TAATGTTTAT 

AACTGGGTGA 

TAAGATTAAG 

AGTCCTAACA 

ATCCTAGTGA 

AATATTATCA 

GATCATAGCA 

AAAATGAACA 

AACAAATTAT 

GGCCACTCTT 

ACTTTTTCTA 

CATACCTTTT 

TCTTTCTCTC 

TCTTTCTTTC 



CTATGTATAT 

TATAAATGTT 

TTGAGACGGA 

TGCAACCTCG 

ACTACAGGTG 

TTCACCATGT 

ATTCCCAAAG 

TTTTTTTTAC 

CTCACACAAT 

GAAAAAAATG 

TACATTATAT 

GTTATTTTTC 

GGGCTATTTC 

TTTACTAATT 

AGCATATGGT 

CTTTAATAAA 

TTAAAAAATG 

TGTCTAGTTA 

TCAGTTCACC 

CTTATTCCTC 

TACTGACTTA 

TCCCCTGATA 

GTTTGTATTG 

ATTCAGTGGA 

ATTGTCGGAA 

TGTTGGCTGG 

AGGATCACAT 

TACTAAAAAT 

GGAGGCTGAA 

CATGCCACTG 

TAATAATAAT 

ATTATTATTT 

AACGTGTGCC 

ATGCATTAGC 

CCCCAGTGAG 

ATAAGTGAGA 

AGAGAGGCTT 

TAAATGTTAC 

GTTAACTTTG 

TCTTGATTGT 

TAAGCGTGTA 

CTTATAAACA 

GCCCTGGCAA 

TGGCAGAAGG 

TGAGGTTTCT 

CTTTGTGGGT 

CACACCATAG 

AGATCCCCTC 

GCAGTTTGAA 

TCCTTATTAT 

CCAGAACTGG 

GGGTAAATTA 

TTTCTCTTTC 

TTTCTTTCTT 



TCACACATGT 

GCTTTTTTTT 

GTTTTGTTGT 

ACCTCCCGGG 

TGCGCCACCA 

TGGCCAAGCT 

TGCTGGGATT 

CACTAAACTG 

CTTATGTAAA 

GAGTCCCTTT 

GAAAGTATTC 

TGCAATGACT 

CAGTTTGTCT 

TAGGAAGAAT 

ATAGTATAGT 

ATTCTATCAT 

TATAGTTTTC 

ATTCTACTAT 

ACATTTCCTT 

ACACCAAGCC 

TTCATACCAC 

CCCCTAGTTG 

TTCATGTTTA 

CTCTGAACTC 

GCTAATGTCA 

GCGCAGTGGA 

GAGGTCAGAA 

ACAAAAAGTA 

GCGGGAGGAT 

CACTCCAGCC 

AATTGTATGG 

TTTAAGTTCC 

ATGGTGATTT 

TCTTTTACCT 

TGTTGTTCCC 

ACATGAGGTG 

AAATTTTTAA 

AACTTCTTTA 

TTATATGAAA 

TGGTGGTGAA 

CTGTCTTAGT 

ACAAAAAATT 

ATTTAGTGTC 

GTTGAATAAA 

GCCCTCATGG 

TAGGATTTCA 

GACAGACACT 

AG AG AG CTTG 

AAATCTACTC 

ATTAAATATT 

TATCAGCTCA 

AGCCAAGAAA 

TTTCTTTCTC 

TCTTTCTTTC 



ATACACACTC 

GGTCACCTTT 

CATTGCCCAG 

TTCAAGCGGT 

TGCCTGGCTA 

GGTCTGGAAC 

ACAGATGTGA 

TTTACCCAAA 

CAAAGATACA 

ATACGTCCTT 

AGGTCAAATG 

ATGTATCACA 

TCCATTTTTC 

CAATAGTTTT 

AGAGTACTTA 

TTTCTGCCTT 

ATTCTAAGGG 

TGGGAAAAGT 

TATACCTTTG 

TCAACGGGTT 

ACATGGAGAC 

GCAGGAGTCT 

TCATCCTTCA 

TTATGAAGTC 

TGTCTAGAAT 

TCAAGCCCGT 

ATTCAAGACC 

GCCAGGCGTG 

CACTTGAACC 

TGGGCGACAG 

AATTGAACTG 

TGGGTACATG 

GCTGCACCTA 

AATGTTCTCC 

CTCCCTGTGT 

TTTGGTTTTC 

GGATCTCTGG 

ATTTCATTAA 

ATCATGATTG 

GCAATGCAAG 

CTGTTCAGAC 

TATTTCTTAC 

TGGTGAGGAC 

CTTCCTTGGG 

TATAACTACT 

ACATGAGTTT 

GTGCCAAGAA 

C AAAATC C AG 

TGAATCTTAC 

TACTCTTGTT 

TGCTCTGCCT 

GTTCTCCTTT 

TCTCTTTCTT 

TTTTTCTTTC 



ACACGTACAT 

TTGCTAAGTC 

GCTTAGTGCA 

TCTCCTGCCT 

ATTTTTGTAG 
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GCCACTGCAC 

CCTGATAACC 

GATATATAGA 
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TTATAGATGT 

AAGTACTCAG 

TTTCTTCCTC 
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GTTTAATTTA 

TGAGTCACAT 

AACATAAAAA 
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TACTTTAATC 
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CTCTGATTGG 

TACAGGATGT 

TCAACCCATC 

CACACCCCCA 
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TGTTCCTGCC 

ACTTTTCTTC 

ATGTATACCT 

GGAGTGAGGG 

AATTCATTCA 

TGCTGTAACA 

AGTTCTGGAG 

AGGTAGCCAT 

TTTCTTTTAT 

GCCCAAAGAC 

TGAGAGGATA 

TTGTGGATAT 

CTATAAAATT 

TTGTGGCATT 

TGGGGGATCC 

TATGCAAATT 

CTTCTCTTTC 

TCTTTCTTTC 

TTTCTTTCTT 



AGAGTTTTAC 

TTACACTTTT 

GTAG CGCGAT 

TAGCCTCCTG 

TTTTTTTATA 

AAGTGATCTG 

CCGGCCAAGT 

CAAGTCAACA 

ATTTTCTTGA 

TTACTCATTT 

GATTCATTCT 

TCTTCCACTG 

TTGGATTTTC 

CTTCTCCCAT 

GCCAGATCCT 

TTTCCTTGTT 

CTTCTTTCCA 

AAAAATTCTT 

CCCAGTCATG 

GGAAGGTGCT 

CTGTTCTGCC 

TTTTCTAGTC 

CTGTCTCTAT 

TCAGATCTTA 

TATCAAAAAA 

CTTTGGGAGG 

AAATGGTGAA 

CCTGTAATCC 

AGATTGCAAT 

ACTCAAAATA 

AAATAGCTGT 

GCAGGTTTGT 

ACCTAGGTAT 

CCCCATCCTC 

TCATTGTTCA 

TTAG CTGTTA 

TACATTACTC 

TATTGAGTTG 

GGTTAAACCA 

TTCAGTAAAC 

AAATATCATA 

GTGGGAAGTC 

CTTTTTGCTG 

AAGGACACTA 

CCCTCCTTCT 

CAGACATTTG 

AGTGATTCTC 

ATG CTTTTTA 

GAATACTTTC 

AGTCTCACCT 

AAGAAAATAT 

TCTCTTTCTT 

TTTCTTTCTT 

TCTTTCTTTC 
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TTTTGGACTT 
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GGAGTTGAGT 

AGGATGGCGG 
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ATTAATGATT 
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TAGCTGACTT 
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CAGTTCAAAG 
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AACCTGTTGC 

GTATTTGAAA 

TAGGGCTTAA 

GGATTATTAC 

GAGAAAAGAC 

ATGAAGAGAC 

AAGCCACTGG 

ATTAACTGTG 

GAACTGACAA 

ATTTGAAATG 

ATCCCAGCAC 

GCCTGGCCAA 

CGGCGCGTGC 

GGAGAATTGC 

CCAGCCTAGG 
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CACCTAGCTT 

GCACCAATTT 

AAGGAATTGC 

CAGAAGAATC 

CTTTTATAGA 

ACTGCCAATA 

ACATATAACA 

CCCAGTGAAG 

TGAAACTTTA 

TCCTCCCACT 



TGACAGGGTC 

AGCCTTGAAC 

ACAACATCAA 

GCTGGTCTTG 

ATTACAGGCA 

CATGATAACT 

TCAACTCTTC 

GATATCCATG 

TTTACATGTT 

GGGCCCTAGT 

AAAAATACTT 
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TGGTAATTTA 
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CGTCTGACTG 

GGGAAGAAGG 

AATACTGGTT 

TAACTTACCA 

ATTAACAGCA 

TCTCTGAAAA 

TACCAATGGA 

ATGCTGTGAT 

GATTTGTGTT 

GTCTCTCAGG 

TTTGGCAGGC 

CATGGTGAAA 

ACCTATGCGC 

TTGAACCCAG 

TGACAGAGTA 

CAAAGTTATG 

GTTGCCCTCA 

CTCAGAAAGA 

TTCTATGTTT 

CACTTTAAAA 

AATACATTTC 

TTGTGACTGT 

CTTTGCAATT 

TAACATTATC 

CCTGGTTTAC 

GCAGGTTCAT 



TTGCTCTATT 
TCCAGGGCTC 
GCTAATTTTT 
GATTCCTGGG 
TGAGCCACTG 
ATAATAAATC 
CTGCTACCCC 
TCCTAATCCC 
TAATAAGTTA 
GTAATCACAA 
CAAGATGTTA 
GTCACTACAA 
CAACCTTGCC 
GAACTGTAAA 
CTACAAC AG C 
TACTCCAGCA 
AGGAAGTCTG 
CGTCACACTA 
CACAATTCCA 
AAACAGGAAG 
CATTCATTAG 
CTGAAGAGGA 
ATTTGTTCAA 
AATTCAAAGA 
GTCTGACTCC 
TTTCAGAAAA 
CGATGACATC 
CGGCAAAACC 
CCACGTGGCT 
GACAGAAAAC 
TAATGGTGAA 
AAGGTCCATT 
ACATAAGGAT 
AGATAACTGA 
CACTGTGGAC 
TAGTCAGACT 
TGGGACTGTG 
AAAAGGAGAA 
TGAGGCGGGC 
CCCTGTCTCC 
ATGCATAGTG 
GATGTAGAGG 
AGACTCTGTC 
AATAAGACTG 
TTGTACAGCT 
TAGGCAGATG 
TCTGGTGAAC 
AAGAAACATT 
ATTTGCTGTT 
TTATTATTAT 
TACGAGGCAT 
CCAATTCTTC 
TCAATTTGGG 
GCTCTTTGAT 



GCCTAGGCTG 
AAGCAATCCT 
GCATTTTTTT 
CTTATGCGAT 
CCCCTGGCCA 
AATAACCAGA 
TCTATCCCTC 
CAGAACCTGT 
AGAATTTTGA 
GGGTCCTTAT 
CACTGCTGGC 
GCTGAAAAGA 
AATACCTTGA 
TAAATAAATA 
AATAAAATAG 
GGTTATTTCG 
AAGCTGATAA 
GGAAGATCCA 
AGAGTCGGGA 
GAGGGGTACA 
GCTGTGGGCA 
ATGAACACCT 
TAACCCCATG 
GGTTCAGTGA 
ATCACTGATT 
AAAGGCTTGT 
TTTTGAGATA 
AAGCCGAGGC 
CACTATTTAT 
TCCCTCCTTT 
GGTAAGTGCT 
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ACTAAAAATA 
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AATGGCAGAG 
CTAGGTGTAA 



GAGTGCAGTG 

CCTGAGTAGC 

GTGGAGACGG 

TCTCCTGCCT 

TTATAACTAT 

ATTTTTAAAT 

AAAAGGGTAG 

AAATATATTA 

GATGGGCAGA 

AAGAGACAGG 

TTTAAGGTGG 

AAAAGAAATG 

TTTTGGCTCA 

ATTTTGTGTT 

AATTAAATAC 

GGAGTATGGT 
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AGGCAGCTTC 
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CCTCAGTTTT 

TCATCTTGAG 

TTAACAACCA 

TATGGATCTG 

AGTATCTAGA 

TTTGTTTTCA 

GTGAAAGCCC 

CAGAATTCAG 

CATTTACTCT 



GTGCAATCTC 

TGGGACTATA 

GATCTCCCTA 

CAGCCTCCCA 

TTTCATTGGC 

AAAGAAAGGA 

GCTGAATGTT 

CCTTATATGA 

TTTTCCTGAA 

CAGAAGAGTC 

AGGAAAGGCC 

GATTTTCCCC 

GTGAAAC C C A 

GTTTCAAGCC 

AGAGATCTGA 

GAGACTCACT 

GAAGGCTCTC 

CAGTCTCAAA 

AAATGGATTA 

CTGGGAAGTT 

ATGGAAAATC 

GGAG CTACTA 

CACAATCCTT 

CTCAGGGAAA 

TGCCCTTTCC 

GTTGTATGCA 

GAGAACTTGA 

AGTGGGGCCA 

ATCTGCAAAA 

CCTTATAGTC 

TTGGGTAGAA 

GGAGAAGTCA 

TAATTCCAAA 

AAGGCTTGAG 

CGCTGTGAGT 

ATACTGATCA 

TTCTATATGA 

CACG CCTGTA 

TTTGAGACCA 

CAGGGCGTAG 

GGCTGAGGCA 

CCACTGCACT 

AAAGAAAAGG 

TTGTAGAGAT 

ATTGTGGTCA 

TCTAACACTG 

GATTCCAGGC 

ATCAAAGGCA 

AGAGGGCTAG 

ATATTATACA 

CTCCAAACTG 

TAAGAGTTTT 

TCCTTGAATA 
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CTAACAGAGA 

CATTATAATA 
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TGAAGCAAGA 

AAATCATGAG 

CATAATATTT 

TTTTTAATAT 

ACTTTGGCAC 

CTGACACTGT 

TAAGTACATA 

TTTTGGAAAA 

AGCCTGATTT 
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CTTTCTCGTT 

CAATTCCAGT 
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CTTGATTTAA 
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GATGTTAGCA 

CTGACCTGAT 
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CCCAGAATTC 

AATTATTGCT 

TATTTACAAA 

TTGTAACCCA 
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AAAGGATATT 

GGTCACAGAT 

ACAGTGCAAA 
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GGGTCTACAG 

TTCACCATGT 
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CAGGCTGGAT 

TTCTAGACTT 
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ATTAGGAGAA 

AAAAAAAGGG 

TTTTTTTCTC 

TTTAACTTAA 

AATTTTTTTT 
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TTACCATGAA 
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TAGCATTTGA 

AGAGAATGGG 
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TTTTCATTAT 

TTCTCCTTTA 
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GGGATCTGGT 
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CAGTAAAATG 

ATTAAAAATT 

TCACTTTAGT 

CCCTTTCAAT 

TGTC CTTTCA 

AGTTCTACAT 

GAGTACAGTG 

CCTGACTCAG 

TTTGTATTTT 

ACCTCAAGTG 

TGCACCCCAC 

GACACATAGC 

TTTAAAGTAC 

TCAAAGG CAG 

GATGGGACTA 

GGCTCTGCCA 

CATGTGGGTT 

CTTTACTTGG 

AGTGGGTTTG 

CCAAAGTTCT 

GCTCACTGCA 

CTGGCTCTTA 

ATTGCGATCA 

GAGGGGCCTG 

AAGGTAAGTA 

ATAGATCTTG 

TGGAACTATT 

GTTATTATGA 

ACAGTTTATG 

TACTCAATCC 

TAAATTCTAT 

TTTG CCAAGA 

ATTTTGCTGT 

TGGAGTGCAG 

TTCCTGTCTC 

TTTTTGTATT 

GTGATCCACC 

GGCTCTGCTG 

TCTTTGAGTT 

AGTTAAATAA 

CCAATTAAAA 

TAATTCATGA 



AGGGCACAGG 
GGGAAGCCTG 
GCTGAAGCTT 
ACTCCCAGTG 
CCATGAAATC 
TGGAATAAGT 
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GTGATTCTCC 
AG CTG AATTT 
GATCTCCTGA 
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GAAATAATTA 
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ATATATTGAA 
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ATCCACCTGC 
GTAGTTTTTT 
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TTCTCTATTT 
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TATATTTAAA 
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AATCTTTGTT 
AAATTATCCT 
TTAATGATTT 



GAAACTCCAC 
CCACCAGGAA 
ATTCCGACAC 
TAACATTGGA 
ATCTGCGAGA 
TTTGGGAAAG 
GTGCAGTGGT 
TGCCTCAGCC 
TTGTATTTTT 
CCTCGTGATC 
GCCTGGCCCG 
CACCAAGCGG 
TTTCCCAGAC 
TTATATTTTT 
AAATAAGCAG 
GGTTGTTCCA 
TATCTTCCAG 
TTTTTTTTTG 
GGCTCATTGC 
AGCTGGGATT 
GGGGTTTCAC 
CTTAGCCTCC 

CCAAGTAGAC 

ATGTATTAAA 

ATAAACTCTC 

GCCCAGGGAA 

CATTCACTTC 

ATCATGTGTT 

CCATATTCGC 

TATTTTGGTG 

GCTGTGAGAG 

TTGTTGTCTA 

CTCAAACTTC 

GAACACCACT 

CTTCAATAAC 

TAGGGCTCTT 

GTATGGAAGT 

AC ATTG CTTA 

GGGGAACTGT 

AATGAAAAAC 

TCTGCCTGTT 

GAAATGCAAA 

TTAAAACTTT 

GTTTGTTTTT 

TCAGCTCACT 

GTAG CTGGGA 

ACAGGGTTTC 

TCCCAAAGTG 

GTCTATTTCA 

CCAGTTATAA 

GTATGTTTAT 

TTAAATTATC 

TATTACATTG 
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AGTAGATTCA 
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GTTCAAGTAT 

TCTGTTTTAT 

TATTTATAGT 

TTTATAGCAT 
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GTCATCATGG 
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GAAAAAGTAC 

GCATCTGTGC 

TCAGCTCAGA 

AAAAGTTTAA 

CCATCTTAGC 

G AGAAAT CAG 

TTATGAGCTC 

TTTACAGAGA 

CAGCAGGTGG 

TCAATCACTA 

TTATCATTGC 

AAAATTCCAT 

ACCTCAAGCA 

ATTTGCAAAT 

CACCAAAACC 

AATTTTCAAT 

ACAAAAGTTC 

GGCTCCAGTG 
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ATTTATCCTC 
TTTTTTTTCA 
AAAATCATTA 
TGGTAAAGTT 
GTACAAGTGT 
CTATAGGTCT 
ATGGCATTTC 
GCTCTTTGAG 
TAGCCTCATA 
GTGATAGCAG 
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ATCCATATTC 
GAAATAATAT 
GCCGAGGCAG 
TAAATATCAT 
CC AG CTATTC 



AGATATACTT 
AATTGCAGTT 
TTTTAAAAAA 
AAATATCGAT 
TTATGCATAT 
CTGTGAAATA 
ACTAGTGACA 
GGCAGGGGCT 
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GAAGAATCAG 
TGGGGAAAAT 
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GACTTGTGAT 
CATCTGAAGT 
GTGTCTTCAT 
ACCAAGTTAA 
CTTACCCTGT 
GTGTGTAATT 
AATCCATTCT 
ATAAAAGAAT 
CTTTTAACAT 
TGATTTTGTT 
TCATTAATGA 
TTCTTTTGGG 
GTGCATTGGA 
ACTGCAAACC 
TTGCTGCCTA 
TGCAATGCCA 
ATTTGAGACG 
GGGAACATTT 
GGCATGCCAC 
TTTATCCCTT 
TTTATGATGA 
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AGGGCAGGGT 
ATTAGATCTT 
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TGTGAGTTTA 
GTGTTACCTA 
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CCACGTACAT 

TGGGACCCAT 

TGAGTAGAAT 

TCATGAAACC 

TGGTGTGTGT 
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AAGATCATGA 
AAGTGTCCTA 
TTG CAAGG AG 
ACCCAGGCAG 
CTACATTTCT 
TGTGACTATA 
CAAAGAGAAC 
GAAACAAAAA 
AGACACAAAC 
TGTTGAAATC 
CCATCTCAAC 
CTTTCTTAAG 
ACAACTGGGA 
TGTTTTTTAC 
AGTCAACATG 
GGCCAAAGAG 
ACTGAGCATG 
GCAGTTCTTA 
ACTTGGGACC 
TAATTTTTAG 
AAACTTGAAA 
ATAAAAATTC 



CTGGGCGACA 
TTATCTGGAG 
GTGTGAAAAA 
TGGCTGGTCT 
AAACTTCCTT 
GTACAATTTT 
TCACTGTGCG 
AATCCCTTTT 
GCTCCTCCTT 
AAC CATTATT 
CATCAACACC 
GACCCCTCCA 
TTG C ACT AAT 
AACAAATGTG 
AAACCTTCCC 
ACAGGTGATT 
TATATATATA 
TCCTGTGCCA 
CCCTATTGTT 
TGACTATGGT 
CTGTGTCCTG 
GGTTCCCAGA 
TTTGGCAAAT 
ATGACTTTGC 
GCTTTCTATA 
GGAATCTCAA 
CTCATCAGTC 
TGTTGGAGAT 
TGAGCCTCTA 
ATTTTCTCAA 
AGCACGTATG 
GAAAGACACA 
AGAATATGTT 
GGCCAGTTTA 
GCTTGCTCGG 
GGATGGAAGG 
ATGTGATCAA 
ATGCTGGAAA 
TCATGTTGAA 
AAAACAAAAC 
ACAATGCCAC 
CAAATTTTCA 
TCAGAACCCC 
TAACTACTGC 
GGGCTACTAG 
ATCTAGGATT 
TTTGGCCTGG 
AGGACCCTTA 
AACTTTTTAA 
ACTCTGGCTG 
CACTCCAGAG 
ATACTCCTTA 
AGTCAATTCA 
ATGTGAAAAT 
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GATCACTGAT 

GAATTGTAGC 
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CAGATATTTC 

ATGATGAGAA 

AGAGAAGATA 

TAGACACTTC 

ATGACAGAGA 

ATAACAAGAT 

TACAAGAGAA 

TACTTTCCCA 

AACACTTAAG 

CAACATGTCT 

CAAGAACCTT 

TTTTGAGCAA 

AACATTGATT 

CTAAAAGAAG 

CAGAAAAAAG 
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TAAAAAGATG 

AC CACTATAA 

ACATGTGGAG 

CAATATCTCC 

ACATATACAT 
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CAATGAGAAT 
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GTTGTGGCTG 

CTTGACTTGG 

TGAATGTAGA 
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GATTGATTCC 

ATGGTTTGAG 
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AATTGCTGTT 

AG CC ATTTTG 



AAATATCTTC 

CATATGTTAC 

TTTTTATTAT 

TAAAATTTAC 
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CAACTGGAAC 

TAAAG CTAAA 

C CAT AAAACA 

TTGTCTGTTA 

TAACAGACCC 
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ATCCGTCCTG 

GATGGATGGT 

ATGTGTGTTT 

AAATAAAACA 

TTTTGACCAA 

TTTTTTTTTT 

TCTTAGCTCA 

CAGTAGCTGG 
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CCACCTGAGC 
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TGACAGATGA 

GAAGAGTTAC 

AAACATTTCT 
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ACTTTTTCTC 
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GACTTTGGCT 

ATCTTTTTTG 

TCCTTTTTCC 

GTTATTTGAA 

AGGAGACTTT 



ATGGTGGGGC 

AGATCTCAGC 

TTTTTATACA 

ATGCTAAAAT 

AACAGATATT 
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TGGATAAAAT 

TTTCATACGT 
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AAAAAAGAAT 

CAATATATAA 
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ACTTTGTGAA 
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CATTCACATT 
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CTTGACTGTT 
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GATGTACTGA 

TGTCTGGATA 

CCATTACTCT 

AGCTTGAAAG 

GATAACTTTC 
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TGTTCATAGG 

GAGTAAATAA 

TAGATGAATG 

TAAAGGTTTA 

GTGAGCGATT 

TGCTCCTGGG 

TATTGTACAT 

TCAAAGTATC 

GGGAAGGGGT 

AGTCTCACTC 

TGCCTCCCGG 

ACCTGCCACC 

TTAGCCAGGA 

GCTGGGATTA 

CTTGGTCCTG 

CAATGGGTCA 

TTAATGAGCT 

ATCAGTTGCC 

CTTCTCTGAG 

TCAGACTTTC 

CCTAATGGCT 

CCCTCCCCAT 

CAAATAGTTT 

AAATATTATG 

TGATGTACGA 

TATATGAGAT 

TGGAGCTTAT 

GAAAAAGATT 

CATTGGTTTG 

TCAATTTCCT 



ATGCAGAGAA 

ACTGTAAAGC 

TAGACGTTTA 

ATTTTCAAAT 

AGCCCAATTG 

TTAACTTAGA 

GAATGTGAAG 

ATAATGAAAA 

GGGAGAAAAA 

AGGCATAGAA 

AACCTTCTTC 

CAGGAATTAA 

AACTGATTCT 

CTATTAAAAA 

TAATGCAAAT 

GAGGGTGGTA 

GAAAGGGGGT 

ATAGGTAGTC 

AGACAACCCA 

GTAAGTGGCT 

AATCAAAACT 

GGAAAACAAA 

AACTTTGGAA 

GACATTTTAC 

AGCACTATCT 

ACCACGGTCT 

GGTCTCATAA 

TAAAATACTT 

TCCCTTGTGC 

GTACTAGAAA 

TTATGATTTG 

ATCCTTTTGA 

CTTGGTCCTT 

TGTCGCCCGG 

GTTCATGCCA 

ATGCCCGGCT 

TGGTCTCGAT 

CAGGTGTGAG 

TTCCTCCTCC 

ATCGATGCTG 

CAGGCTTTGA 

ACGGGTGATA 

GTTTTCCCAG 

AGGGTATCTT 

TCAGAATAGA 

CCCTAAAGGA 

ATTGTCATCT 

AATTATATTA 

CAGAATTTGA 

TGGCTCTGTA 

GCTGATTTCA 

GACTAGAATG 

TAAAAATCAT 

TCAGTTACTG 



GATCTGCTCG 

TATAATCCCC 

TTTATGTGAT 

TATTTAGATC 

CAAGAGATTA 

AATCAAGGAC 

GAAGGCAATG 

TTGGAGAGAG 

TATTGAAGAC 

GTGTATCACA 

ATATTTTTCT 

TTTGGCTCCT 

GGTTTTGCCA 

ACTG CTTGGA 

CAATGGGTAT 

CTAAGATGAG 

TTGCAGAAAA 

AAATTATGTA 

ACTAAAAAAT 

CCTTAACATA 

ACAATGAAAT 

ATGTTGCCAG 

AGCTGCTCGG 

TTAGAAATGC 

GTAATAGCCT 

ATTTGTATAG 

GCACAATATT 

TTTAAAAACA 

AGGGATGGGG 

TATTTTATTT 

TGCACGTTTA 

GAGCTTCTGC 

CGGTCCTTTG 

GCTGGAGTGC 

TTCTCCTGCC 

AATTTTTTGT 

CTCCTGACCT 

CCACCGCGCC 

TCTTTTGTTG 

TCAGCTTTTG 

AGAAACCATG 

AGAACAATGA 

CAGCTTCTCT 

TCCTTATGTG 

TG CAATTGTG 

TTGTTTCTAA 

ACCTAATGAT 

ATGTATTTCT 

TTCACTAATA 

TG C AT ACATG 

AAAACAAGAA 

GAATTTTTAT 

GCAGGCTGAA 

GTCTTTTAAG 



Figure 9 (Page 70 of 74) 



SUBSTITUTE cucct/diii c oe\ 



WO 98/14466 



PCT/US97/17658 



159/162 

22 6801 GGGTTTTATA TTTTTCTTTG ATCAATTTTG ACCATTTATG TTATCTTGGA GGATCATCTA 

226861 TTTTACACAC TATTTAAAGT ATATTTGCAA AAATTCAACT GTTTTATCAG GCTATCTTTT 

226921 TAATAATATA TTCATTTTAT CTATATCTGA GGTTTTAGCT TCTTTGTACT TCTGACCCAA 

226981 TTGCATGTGT GCTTTCTTTC TCCTTCATTA GACTACTTAG TCATTTACTA ATTTTAAGAA 

22 7041 TAGCTTGTCT TTTATTTATT TACTTATTTA TTTTTGAGAC GGAGTCTCAC TCTGTCACCC 

227101 AGGCTGGAGT GCAGTGGCGC GATCTCGGCT CACTGCAACC TCCGCCTCCC GGGTTCAAGT 

2 2 7161 GATTCTCCTG CCTCAGACTC CCGAGTAGCT GGGATTACAG TCATGCACCA CCATGTCTGG 

22 7221 CTAATTTCTG TATTTTTAAT AGAGATGGGG TTTTGCCATG TTGG C CAAGC TGGTCTCAAA 

22 7281 CTCCTGACCT TAGATGATCT ACCCACCTTG GCCTCCCAAA GTGCTGGGAT TACAGGCATG 

2 2 7341 AGCCACTGCG CCCAGCCCTG CTTGTCTTTT TATTTTATAT TTGATTAGCT TTATCTTTTA 

22 7401 TCAAGCTTAT GTCCTATTTC CCTTTGCTTT ACTTCATATA AATTTTGTTT TGGATAGTTT 

227461 ATTTATTTTT CATTTAATTA TGAAACAGGT TAAAGCTTAG AGGAAAATTG CTCCTCTAAG 

227521 TCCACTTTTG TGGGCAGATT ACATTTTGCT GTGTTGTGCT CCCAAATTCA TTGTTCTTTT 

22 7581 AATGCTTTAT TTCTCAAGTT AATAACCTAT ATAGTAAAAA AGTGGCTGTT G ACTCTC AG C 

227641 TTTTTTTTTT TTTTTTTTTT TTTTTTTGTA GATACAGGGA TCTTGCTGTG TTGCTCAGGC 

22 7701 TGGTCTGAAA CTCCTGGCTT CAAGGGATCC TCCTGCCTTG GTCTCACAAA ATGCTGGGAT 

22 7761 GACAGACATG AGACACCATG CCCAGCCATG TCTCTCTCCT TATATATAAT AAGAAAACAG 

227821 ACACACTGAG GCATCCTATC ATCTCACTCT TGGTTTCACT ACTGTTCTCT GGAAGTTTTG 

22 7881 CTCTGACCTT TTGCAGTTAA TGTATTAATT TTGCATTGAG TAGTTTCCAT AGAAGAATTA 

22 7 941 TAG CATTTGC ATTCTGTTGG GTATTATACT TTTCACTGTT ATTTGAACAT AATTTGAGGG 

228001 CTGAAACCAA GATGAGGCAA GTGAGGTGCC CAGGAAGCAA TATTTAAGGA GGCATCCTTT 

22 8061 CTTAGGCTCA TGCAAGAACA GAATTGGCAC ATGAGAGTGA GTGCCTCCTT AATTTTGAGT 

22 8121 G CTGG AC ACT TCTTGCTCAC TTAGCATACC CCTGGACAAT GAAGTGTTTT TTGTTTTGTT 

22 8181 TTTTCATGTC CATCCTTTAT CCTTCTTCAT CTCAAAACAT TTCAATGGAG TATTTTTTTG 

228241 GAGCAGTACT TGGATGAGCC TCTGAGTCCC AC AGTAG CTG AGAATTTATT TCATAGTACT 

228301 CTTTATGATC ACTGTGGAGC CTTAAAACAT TGTAATATTA ACTTAGCTGG GAACAGAAAT 

22 8361 TTTGTTCCAC AATTTGTCTT ATTCAGAACA GTATTGACTT CCTGCTAGTC TCTTCTGATG 

22 8421 TCCAATATGA GGAAGTCTAG TTAGCCAGCT ACTTTTTGTA GGAGAGCTAT GTTTAGGCXA 

228481 GGTGCTATAG GATTCTCTTT ATCCTGGAAT TCCTTCACCA AGATGTGCCA AGGTGTTAAT 

228541 CATTTTCTCT TGCTTTTTGG CTGGTGGTCT TAGAGTTTCC TTCGATTTTG TTTTATTTAG 

22 8601 TGATTGTCCT CAATTTGTTT TCTTTACTAA GAATCTCTCT TCTATTTATC TGTATGGTAA 

22 8661 AACCTTGTTG CCCATCTTTC TGGTTTCTGC TGACTTTCAT TTTTGGACCT TTTACTTTGC 

22 8721 TTTCTCCATG GACTTTTTGG TAGTGGAGGC AGGCAAACAC TTTCCAAAGT CTTTCTCAAT 

22 8781 TTCCATCAAT TTCAACTTAT TTCCTAAAAT TGCCTCAGAA TGTGCCTATG TCCACAATAT 

22 8841 CCCTCCTTCC ACTTTAGAAA GGAAAGGCAT CCACACTTTA TTTAGGTGCA ATGCCTGAAG 

22 8901 TGTAAACACT TTCTGGTTGT CAACAAAGGA GTACTTCCAA ATATTGGTTT GGGGATAACC 

22 8961 TGCTAATGAT TAACACATTC ACCTTGGCTC TTGGTTTGCC TGCTCCCTCT TCTTTTATCT 

22 9021 GCTGTGTGTA TTTTTTTTAA TCACTGAGAA TATGCACAGT ATTGTATGTT TTATTATAAG 

229081 AGAGGACTGG CCAGAGTGGG AATGTTCTGA ATTCAGAATA ACTGAAGCAG TACAGGATAG 

22 9141 GAACTCATTC TTTCAAATGA AGCTGGCATA TTTTCCCAGA GCACCAAATT TCAATATATA 

229201 TTTAAAAAAC TTGATATGAA TGATACAATA AAGTGGTTAG AACTTTTATT AAAATAAACT 

229261 TATGTCATGA AATAC TTATT CTAATTATAG TCACTCTTCA TCTTATTTCA TCTTATAACA 

22 9321 TGTTTAATGT TTTCTTTTAT TTACAAAACA ATTTATTTTT TGATGAAAAG TTTTAGAAAT 

22 9381 CAAGTTAAAA ATATTCAAAG GAATGCCTAA AGTTTTCAAA ATTCTTTTAC ATGTTGTACA 

22 9441 ATCAAAAGAG TCTGAAGACC ATTTAGCTAT CCAAATTGTT TATTTTTAAG CAGTAT CCC T 

229501 TCTAATATTT ACTATTTATA ATCCTTAAAA ATTTGCCTTA GCACAGGAGA ATTGCTTGAA 

22 9561 CCCAGGAGAC GGAGGTTGCA GTGAGCCAAC ACAGTG C C AC TGCCCTCCAG CCTCGGCGAC 

22 9621 AGAGTGAGAC TCTGTCTCAA AAAAAAAAAA AAAAAAAAAA AAAAAAGGCC AAAAACAAAT 

22 9681 AAACAAACAA AAAAATCCGC CTTAACATTA TTTGTTCATT AAAAACTTTC TTTAATACTA 

22 9741 CTAGTTTCCC TTTCCTCTCA GCCCATTGTC ATATTTTGAT TTTTATCACT TGCTTTGTAG 

22 9801 GACATATGAG GTTTTTGTTT TTTTTTTTTT TTGGAGATGC AGTCTCCCTC TGTTGCCCGT 

22 9861 GCTGGAGTGC AATGGCGCAA TCTTGGCTCA CTGCAACCTC TGCCTCCTGG GTTCAAGCAA 

229921 TTCTCCTGCC TCAGCCTTCC AAGTAGCTGG GATTACAGGC ACCCACTACC ACGCCTGGCT 

229981 AATTTTTGTA TTTCTGGTAG AGACGGGGTT TCACCATGTT GGCCAGGCTG GTCTCGAACT 
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CCTGACCTCA 

CCACCTGCCC 

TAGGAATTCA 

TTATATTGAT 

GATTTTTGCA 

TTCAAATAAA 

GCCTCAAAAG 

TGTATTTTCC 

AGTAAAGTGC 

CAGACCGTTT 

ACACATAATT 

AAGAAATCTT 
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GGGACTCCTC 

AGGTGATTTC 

ACGGAGCCCA 

AGCTTTCAAC 

GAAGATGATT 

TTAAGAGTGA 
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GGAGCATGAA 

GAACCGATTC 

CATACAGAGG 

AACTGTGTCA 

AAATTGAGTA 

GTTAACTATT 

TCTTTAGTCT 

TATATCCTCG 

CCACTTTTTT 

TGTCCCTGTC 

GTATCACACA 

AGACAAATGC 

GTTATATGGA 

AAGATGACAT 

AGCCCTGACT 

CGGAGTGCTG 

AATAAATAGG 

ACAAGGTAGA 
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TGTGCATGAG 

GCTCCATTTT 

TTAAACACAG 

AAGAGATGAA 

ATGATG CCTG 

TTTACTGCTA 

TAATATATGA 

GCCCCATTGC 

TTTGATTTCA 

GTCTTGTTGG 

TCGTTCTGTC 

TCCCGGGTTC 

ACCACTATGC 

GAAGCACCTA 



AGTGATCCAC 
AGCCAGAATA 
GTTACTTTCT 
TTCTCTTTTT 
CTGTAGTTAA 
TTGAGGTGGG 
GTCTTAGCTG 
CTCTACTCAA 
TCACTCTTTT 
TAGCTTCCAA 
GAGAAAAGAT 
GGAAATAGGT 
TCTACTTAAG 
TTTGTAGCTC 
AGTTAATATG 
TCAGCATTCC 
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CTGCCTCTTT 
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AATCCAGGCC 
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CATAGGTTCC 
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AG AG C CTTCT 
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AGGTTATTAT 
TTTAAATCAC 
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GAAGGGAGGC 
AAAACCCTCT 
TTTGGCACTG 
CTCACAGAAA 
GCATTGCTAT 
TTTTCTGCTA 
ACCCAGGCTG 
AAGCTATTCT 
CCCACTAATT 
GAAACTCTAA 



AATC CTTGGC 

TATGTTCATT 

TGAGAAAATC 

CATATTGAGA 

AGAAACCACC 

GTTACTCTGA 

TAGCAACTTG 

CATTTAAGGT 

GCTTTAACAA 

AGGGAGTTCA 

AGTTCCACCA 

TTATATAAAA 

CCACCCATTT 

CAAGTGCCAC 

ATCAATTATT 

CTGCAGGGAA 

TCACTTTCAG 

TAATATGTGA 

GTGGTCCAGC 

TTGGTGCCAT 

CAGTGGACAG 

AATCTGGCAC 

CTGTTTAGCC 

ACGGGCCTTT 

AAATGGTGGC 

TCTTTTGCAG 

AATTTTTAAT 

GCTCTCCATG 

GGATTATTCC 

GGTGAGATGC 

ATAGCCTAAG 

GCCAGGCACC 

CATGTGAAAG 

GCTAAAAAGA 

GCCCATGTAT 

GACTTTGAGA 

TAGTGTGTAT 

TCCAAACTTA 

TTGACATTTA 

AACTGCGTGC 

GGGAGACAGC 
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CCCCTGCACC 

TCTTTGGATC 

AGTCTGTCAC 

GAATTTCATA 

TTTTTCTCTT 

ACTCCTGCTT 
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CCTGCCTCAG 

TTTGTATTTT 

TTCTTTGTAG 



CTCCCAAAGT 

TTGAGTCCTT 

TCTGAAAAGA 

ATTGTTTTTT 

TGTGTGTTGG 

GAATCAAAGG 
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CTCAGAAGAT 
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CTCCTCAGAT 
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CATCATTATT 
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GCTATGATTA 
TAACAAAGTC 
TGCCAATAAT 
AAAAAGTTTG 
TTAAGCCATA 
AAAACCTGAA 
GAAATAAATA 
AATATAATTG 
GCTGGTAGGC 
ATTCACGACC 
AAATG CTG AC 
CCTTTTTTAT 
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TAGGACCTGA 
GGCTCTAATG 
TTTTATCAAC 
TAGTTGCTTT 
TCAGAAAGTG 
CCACATCTAA 
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TACAACTTTG 
AATTTTTGTT 
GTGGCATACG 
ATGAACATAG 
TTTACAACTA 
TATTCATATA 
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TTGTTAATAT 
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TACCTCCACT 
CTAGGGATAC 
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TTGGTTTTAT 
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AAGAGAGCTT 
TAGGTAAAAG 
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TCCCACTCAT 
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GTTATGGTCC 
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TGCAGAGGGA 

TTGAACAGCT 

TTTGAAATCA 
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GGAGTTCTTG 
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ACAGGAACTA 

AGGTACAATG 

TTTCTTCAAA 

TCAAGTTTAG 

TTGCTTTTAT 

ATACTTTGAG 

ATACGTGTGT 

C C AAAT AT AT 

AGCAGTAAAC 

TAAAGTG CAT 

ACTCATTGAA 

TTCTCTGGAG 

TCACACTATT 

GTGGATGAAT 

GGCTCCTCTT 

AGACTAATTA 

GCTCGCCAAG 

CTTGCAGGTT 

AGAGCCCTAG 

TGACCACAAC 

CCGACTGTTC 

TACTGTCTGC 

TTTTTTTTTC 
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TTCATTATAC 

TCCTTCCCAT 

AGACGGAGTC 

AACCTCCGCC 

ACAGGCGCTC 

CACATGTCCA 

TTCCTCTAAT 
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AGTATAATTT 
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TAATCTAAAA 
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CAGTATAATG 
AGGGATACTC 
TCTTCAAGGT 
TTGGATTTTC 
TCTTTCACCT 
TATAAGCAAC 
TTTCACCACT 
ACTGCATTTT 
TTCTTGCTTT 
CCGGAGTTAT 
ATTTTAGGGA 
TAGGTAATGT 
TGAACTTATC 
CATTGGGGCC 
AATGTTTGAT 
ATTTTTAGTT 
TTGTGGTGTT 
CTTAGTTGGC 
GTGTCTATCT 
TGTCAACTTG 
GTTTGTGAGC 
CTGTCTTTCC 
AAGGCAGAGG 
CTCATCTGGT 



AATCCCTGAT 
CCTGGAAAGC 
AAGAATCAGT 
TATCTGAATG 
TTGCTGCCCA 
GGTTCAAGTG 
CACACCTGGC 
TTCCTCGAAC 
ACAGGTGTGA 
AGTCTGACAC 
CAGATTTCCT 
GCCTCAAGTT 
CTGAAACAGG 
CATTTTTCCT 
TGTTAATTTA 
TTAAAAAATT 
AAATGAGGAA 
TGTTAGCTTT 
ATGCTCATAA 
AATCCCAGCG 
GCCTGGGCAA 
GTGGTGCATG 
CAGGAGTTTG 
GTGAGAACTT 
ACTTTTGTTT 
AGTACTTAAA 
TTATCCAGAA 
ATTAGCTCCC 
ATTGGAAATT 
GGAGTGCTCA 
CAAACATTCC 
AACGTGTGTT 
AACCTCTATC 
AGGAAAGTTG 
AGCTTTCCCC 
TCACATTGAT 
GTTTCCACTA 
CTTGTCATAT 
TCATGACCTT 
AGATTTTTTG 
GAACATGATA 
TTCAGGTTTC 
AATTTTGTTT 
AAATCTTAGA 
ACATTCTAAA 
ACTTTTGTAT 
CCAGTACTAT 
AATATTTTTG 
TTTTGACAAA 
ACTGAGTCAG 
GTGTTTCTGG 
CAGTGTGGAT 
AAGGGGGAAT 
CTCCTGCTCT 



TCCCAAACAC 
TCTTACACAA 
CACCCAACAG 
TAATCTCCCA 
GGCTGGAGTG 
ATTCTCCTGC 
TAATTTTTGT 
TCCCAAACTC 
GCCACCATGT 
AAAGTGGATA 
GGTGCTCTCA 
TTTATCTGTA 
AAATACATAT 
GTTTCCCCAA 
ATTTAAAAGA 
ATCTCTGGAA 
CTAGAGAGCA 
TTATTATTTT 
TAATAAGTTT 
CTTTGGGAAG 
CATAGTGAGA 
CTTGTACTCC 
AGGCTGCAGT 
GTCTCAAAAA 
CAAAATATGT 
AGTTAATAAC 
AATCATCCAT 
AGGTAATTAG 
CAAAATGCTC 
TGGAGTATTT 
AAATCTGAAA 
AGCTAATTAG 
CTCACTTCTA 
CAAAGATAGT 
CATTGTTAGG 
ACATGAAACT 
ATGTTTTCTT 
CTCCCTAGTC 
AACAGTCCTG 
AAATAATACC 
TCCACATGAC 
TCTACTGCAA 
TCTTCCATGA 
TCATGTAAAT 
AGATGTAATG 
AAGGTGTGAG 
TTGTTGCTAA 
TTGGTTTATT 
ACTGTTGATT 
GGGATAACCA 
ATGAGATTAG 
GGCATTATGC 
TTGGGCCTTT 
TGAACTGGGA 



GGTCTTTTCA 
ACACGCCCTC 
TGTCCTTGTC 
GAGGGTGTTA 
CAGTGGCATG 
CTCAGCCTCC 
ATTTTTAGTA 
AGGTGATCCA 
CCAGCCCCAT 
TAACAATATT 
AAGTTTTATG 
CTATGATTTC 
AACTGAAAAA 
CTTCATTTTC 
AGTAGTCTAC 
GGATACACAG 
TGGC CAAGTG 
CTTTTGTAGG 
AAAATAAAAC 
CAGAGGTGGG 
CCCTGCCTCT 
TAGCTACTTG 
GAGCTATAAT 
AAAAAGGGGG 
AATATTTAGC 
TATTGTCTCC 
ATCAGCAAGC 
CAGGCAGCCT 
CAAAATCTGC 
CAGATTTTGG 
AAATCTGAAA 
ACCCTTCATG 
ATAGCATGAA 
ACAAAGACAG 
ATTTTACATT 
CTATTAACCA 
TCTGTTCCAA 
TTTTTTTGTC 
AAGATCATTT 
ACAAGGGCAA 
ATCACTGATA 
AGTGATTTTT 
CTAATACTTT 
TTTCTTCTAT 
TTTGATACAT 
AGATGTCTCC 
GACTATCTTT 
TCTAGACTGT 
ACAGTAAGCT 
GCTATCTGGT 
CCTTTGAATA 
CACCTGATAT 
TTTTCTGCCT 
TTTACAT CAT 



TATACATTTT 
CCCTAGGAAG 
ACATCTTAGG 
TCATCTTTTT 
ATCTCGGCTC 
TGAGTAGCTG 
GAGACAGGGT 
CCCACCTCAG 
CTTTTTCTTT 
TTGAATTATG 
TTACAAAAGA 
AAACCAAATA 
TTTTGGTATG 
TATAGCAATA 
CATCTCTTCT 
GGAACATTGC 
GGGTTTTGCT 
TTTGAATTTC 
TTTTGGCTGG 
AGGATACTTG 
GTAGAAATAA 
GGAGGTTGAG 
CACCCACTGC 
GGGGGAAACA 
ACTAAAGAAT 
TTTAAAAGAA 
TAAACTTTCT 
CTACTCAGGT 
AACTTTTTGA 
ATTTTTGGAT 
TACTTCTGGT 
GTCTCTTCTA 
CTTTTCTGTT 
TACAGGAGAG 
ATTATGATAC 
AACCCTAGAC 
GGTCCAATCT 
TGTGACAATG 
GCTTTTTTTT 
AGGGCCCTTC 
TTAACCTTCA 
TTCCCTTAAT 
TGTTATTATA 
ATTTTATTCT 
TACATCTAGT 
AGTTTCACTT 
TTTCCATTGA 
TTATCTCATT 
TTGAAATAGT 
TAAACATTAT 
GGTGATCCTA 
TCAGGGTCTG 
CACTGCTTGA 
CAGTTCCTCT 



CCACTGTACA 
CCTTTATAAA 
TTCTACACCT 
TTTTGAGATG 
ACAGCAACCT 
GGATTACAGA 
TTCACCGTGT 
CCTCCCAAAG 
TAGTTTAGTT 
AATAACTAAA 
AAAACAAGTC 
AAAAACAGGT 
TTAGTATGAT 
AAAAGAAACA 
GTTAAAAAGA 
TCTGGTTTCT 
TTTGTTTTTG 
AAAC C AC ATA 
GTGCAATGAC 
AGGCCAGGAA 
ACAAAAATTA 
GCAGGAGGAT 
ACTATAGCAT 
AATAAATAAA 
TCTGAATTGT 
TTGTTATCAA 
CAAAATGACA 
TGAGTATTCC 
ATGCTAACAT 
TTGAGATACT 
TCTAAGCATA 
GACCTCAGCT 
TTAGAATAAT 
TTCCCATATA 
ATTTGTCAAA 
TTTATGTGGA 
GGAATACCAC 
TCTCAGTCTT 
CATAATTACA 
TTGTCACATC 
TCATGTGGTT 
TTAGCCCACC 
GCTAAAACTT 
AAAAG CTTGT 
CCTTTGATTT 
TATTAACACA 
TTACCTTTGC 
CCACTGATTT 
TCATTTTTTG 
TTCTGGCTGT 
GTAAAGTAAA 
AATAGAAGAA 
GCTGGGACAT 
GGTTCTCAGG 
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AGATCATGGG 
AACTGCCCCA 
CAGATTTCCC 
AGTGCAGGGG 
GAAAGCTCTA 
ACATTACATT 
AGTCATTCTT 
AGAAGTACTT 
AAAAAGTTAA 
ATTTAGGAAT 
AATGG AG CAT 
AAAAGGAGCT 
TGTGTGTGTG 



CAGACTGAAT 
ACTCCTCATC 
CTGCAGATTA 
TTCATCCAGT 
TGGGGATGAG 
GAAGTGTTTG 
TTTCCCAGAA 
CCTGATTATC 
TTGGAACACA 
TGAAAAACTA 
TTGCCTTACC 
AGAATAAGAG 
ATAAAGCCTT 
TGTGTGTGTG 



CATACCACCA 
TTCCATAAAT 
AGGCTTTTTT 
GCCCTCTCCT 
GGTATAGTCC 
ATACATACAT 
AAAAAGGAAT 
AAAGGTAAAC 
AGGAATTCTC 
TAGTACCTTC 
AAGTAAAACA 
TAGTAAAGAA 
TAGGTATTTT 
TGTGTG 



GCTTTCCTGG 
GCATGAG CCA 
CCACTAGGTG 
CTTTAAGTTA 
TCTTGTTTGC 
AAACAAGGCA 
GTATAGGCAT 
AGTTATTAAT 
TGGGAGTCCT 
CTATAAGCTG 
TAAGGGCAGC 
TGCCAAAAAT 
CACACTTGCT 



GTCTCCAGCT 
ATTCAGTCTA 
AAATAAAGAA 
CAACACATTG 
TGAGAAGAGA 
TGGTTTTTGC 
CACGTAACTG 
CCTATACCAA 
TACTACTCTC 
GATGACTAAT 
TGAGGTGCTG 
GCTGTCATGT 
CTGTTACGTA 



TGCAGATTAC 
TGTCCTTGAA 
GCTTGTTAGA 
GCTACACCTA 
ACTGTATTGG 
ACTTAATTTC 
TACTAGCTGG 
GATGTCAAGG 
AAGCCCAGTG 
TACCAGGCTC 
ACTGAAGACA 
ATCCATTGAC 
AATGTATGTG 



Figure 9 (Page 74 of 74) 

SUBSTITUTE CUCCT/OIII p OR\ 



INTERNATIONAL SEARCH REPORT 


Internationa! application No. 
PCT/US97/ 17658 


A. CLASSIFICATION OF SUBJECT MATTER 
IPC(6) : C07H 21/04; C12Q 1/68; C12N 15/63, 15/85; C12P 21/02 
j US CL : 536/23.5; 435/6, 70.1, 325, 320.1 
According to Internationa! Patent Classification (IPC) or to both national classification and IPC 


B. FIELDS SEARCHED 


Minimum documentation searched (classification system followed by classification symbols) 
U.S. : 536/23.5; 435/6, 70.1, 325, 320.1 


Documentation searched other than minimum documentation to the extent that such documents are included in the fields searched 


Electronic data base consulted during the international search (name of data base and, where practicable, search terms used) 
APS, DIALOG'S BIOTECH cluster. 
1 hemochromatosis, BTF1, BTF2, BTF3, BTF4. NTP-3, NTP-4. RoRet, butyrophilin, type I sodium transport 


C. DOC 


:UMENTS CONSIDERED TO BE RELEVANT 


Category* 


Citation of document, with indication, where appropriate, of the relevant passages 


Relevant to claim No. 


A, P 

X 

X 


RUDDY, D.A. et al. A 1.1-Mb transcript map of the hereditary 
hemochromatosis locus. Genome Research. May 1997, Vol. 7, No. 
5, pages 441-456, see entire document. 

FISCHER, L. et al. Cloning of the 62-kilodalton component of 
basic transcription factor BTF2. Science. 04 September 1992, Vol. 
257, pages 1392-1395, see entire document. 

MARGOTTIN, F. et al. Participation of the TATA factor in 
transcription of the yeast U6 gene by RNA polymerase C. Science. 
25 January 1991, Vol. 251, pages 424-426, see entire document. 


1-20, 22-77 
28-33, 71 
22-27, 70 


|_x| Further documents are listed in the continuation of Box C. (^] See patent family annex. 




Special calegones of cicd document*: "T" later document publi.hed after the international filing date or priority 

-A" document defining the general state of the art which is not con.idered J? U T ' m ~ nnicl "j* a PP ,icaUon bul cited 10 understand 
to be of particular relevance wniiuerea principle or theory underlying the invention 

*B" earlier document published on or after the international filing date " X " document of particular relevance; the claimed invention cannot be 
. considered novel or cannot be considered to involve an inventive step 
document which may throw doubts on priority claim(s) or which is when the document is taken alone 
cited to establish the publication dnie of another citation or other 

special reason (as specified) 'V document of particular relevance; the claimed invention cannot be 

-11- Hnrun, r„ . ... ■ ■ . considered to involve an inventive step when the document is 

meo„» iU%cloMrc ' usc - "h.b.tion or oiher combined with one or more other such documents, such combination 
' " " J " lS being obvious to a person skilled in the art 

"P* document published prior 10 die international filing date but later than • 1 u r t 

die priority date claimed & document member of the same patent family 


Date of the actual completion of the international search 
20 JANUARY 1998 


Date of mailing of the international search report 

\ 2 FEB m 


Name and mailing address of the ISA/US 
Commissioner of Paienls and Trademarks 
Box PCT 

Washington. D.C. 2023 1 


Authorized officer J 

F. PIERRE VANDERVEGT /T/^^— 



INTERNATIONAL SEARCH REPORT 



International application No. 
PCT/US97/ 17658 



C (Continuation). DOCUMENTS CONSIDERED TO BE RELEVANT 



Category 



Citation of document, with indication, where appropriate, of the relevant passages 



Relevant to claim No. 



X 
A 
A 



ZHENG, X.M et al. Sequencing and expression of complementary 
DNA for the general transcription factor BTF3. Nature. 05 April 
1990, Vol. 344, pages 556-559, see entire document. 

PANTEGHINI, M. Electrophoretic fractionation of 5'-nucleotidase. 
Clinical Chemistry. February 1994, Vol. 40, No. 2, pages 190-196, 
see entire document. 

BURT, M. J. et al. A 4.5-megabase YAC Contig and physical 
map over the hemochromatosis gene region. Genomics. 15 April 
1996, Vol. 33, No. 2, pages 153-158, see entire document. 

VERNET, C. et al. Evolutionary study of multigenic families 
mapping close to the human MHC Class I region. J. Mol. Evol. 
November 1993, Vol. 37, No. 6, pages 600-612, see abstract in 
particular. 



34-39, 72 
52-57, 75 
1-6 

7-20, 22-77 
1-20, 22-77 



Form PCT/ISA/2I0 (continuation of second shect)(July 1992)* 



INTERNATIONAL SEARCH REPORT 



International application No. 
PCT/US97/ 17658 



Box I Observations where certain claims were found unsearchable (Continuation of item 1 of first sheet) 



This international report has not been established in respect of certain claims under Article 17(2)(a) for the following reasons: 
1. £H Claims Nos.: 

because they relate to subject matter not required to be searched by this Authority, namely: 



2. I I Claims Nos.: 

because they relate to parts of the international application that do not comply with the prescribed requirements to such 
an extent that no meaningful international search can be carried out, specifically: 



3. Q Claims Nos.: 

because they are dependent claims and are not drafted in accordance with the second and third sentences of Rule 6.4(a). 



Box II Observations where unity of invention is lacking (Continuation of item 2 of first sheet) 



This International Searching Authority found multiple inventions in this international application, as follows: 
Please See Extra Sheet. 



| x| As aU required additional search fees were timely paid by the applicant, this international search report covers all searchable 



claims 



1 1 As all searchable claims could be searched without effort justifying an additional fee, this Authority did not invite payment 
of any additional fee. 

3 ' ! 1 As on 'y some °f lne required additional search fees were timely paid by the applicant, this international search report covers 
only those claims for which fees were paid, specifically claims Nos.: 



| 1 No required additional search fees were timely paid by the applicant. Consequently, this international search report 
restricted to the invention first mentioned in the claims; it is covered by claims Nos.: 



Remark on Protest | ] The additional search fees were accompanied by the applicant's protest. 

[ Xj No protest accompanied the payment of additional search fees. 



Form PCT/ISA/210 (continuation of first sheet(l))(July 1992)* 



INTERNATIONAL SEARCH REPORT 



International application No. 
PCT/US97/17658 



BOX II. OBSERVATIONS WHERE UNITY OF INVENTION WAS LACKING 
This ISA found multiple inventions as follows: 

This application contains the following inventions or groups of inventions which are not so linked as to form a single 
uiventive concept under PCT Rule 13.1. In order for all inventions to be searched, the appropriate additional search 
tees must be paid. 

Group I. claim(s)l-20, drawn to polynucleotide sequences containing at least one polymorphic site, polypeptides 
encoded thereby, antibodies to said polypeptides and a method to determine the presence of the HFE gene mutation. 

Group II, claim 21, drawn to the lymphoblastoid line atcc crl-12371. 

Group III, claim(s) 22-27 and 70, drawn to BTP1 nucleic acids, gene products, vectors and antibodies. 
Group IV, claim(s)28-33 and 71, drawn to BTF2 nucleic acids, gene products, vectors and antibodies. 
Group V, claim(s) 34-39 and 72. drawn to BTF3 nucleic acids, gene products, vectors and antibodies. 
Group VI. claim(s) 40-45 and 73, drawn to BTF4 nucleic acids, gene products, vectors and antibodies. 
Group VII, claim(s) 46-51 and 74, drawn to BTF5 nucleic acids, gene products, vectors and antibodies. 
Group VIII, claim(s) 52-57 and 75, drawn to NPT3 nucleic acids, gene products, vectors and antibodies. 
Group IX. claim(s) 58-63 and 76. drawn to NPT4 nucleic acids, gene products, vectors and antibodies. 
Group X, claim(s) 64-69 and 77, drawn to RoRet nucleic acids, gene products, vectors and antibodies. 

Utr^T,^ V ?T PS IX d ° " 0t M 10 a 8ingle invendvc undcr PCT Rule 13 * beca^. under 

m v V they la ck me sam c or corresponding special technical features for the following reasons: Groups I and 

Thl ivTn^r, 10 / ^ y ? fferent 8enea and * eir gcne P^"^ and «ch therefore constitutes a separate invention. 
The lymphoblastoid ceU line of Group II is not dependent upon the vectors of any of the Groups I and ihoc and 
toerefore constitutes a separate invention. Accordingly, the claims are not so linked by a special technical feature 
within the meaning of PCT Rule 13.2 so as to form a single inventive concept. 



Form PCT/ISA/210 (extra sheet)(July 1992)* 



